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10 

FIELD OF THE INVENTION 

15 

The present invention relates to transgenic non-human animals that are engineered to contain 
human immunoglobulin gene loci. In particular, animals in accordance with the invention possess 
human Ig loci that include plural variable (V H and Vic) gene regions. Advantageously, the inclusion 
of plural variable region genes enhances the specificity and diversity of human antibodies produced 
by the animal. Further, the inclusion of such regions enhances and reconstitutes B-cell development 
to the animals, such that the animals possess abundant mature B-cells secreting extremely high affinity 
antibodies. 

BACKGROUND OF THE TE CHNOLOGY 

The ability to clone and reconstruct megabase-sized human loci in YACs and to introduce 
them into the mouse germline provides a powerful approach to elucidating the functional components 
of very large or crudely mapped loci as well as generating useful models of human disease. 
Furthermore, the utilization of such technology for substitution of mouse loci with their human 
equivalents could provide unique insights into the expression and regulation of human gene products 
during development, their communication with other systems, and their involvement in disease 
induction and progression. 
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An important practical application of such a strategy is the "humanization" of the mouse 
humoral immune system. Introduction of human immunoglobulin (Ig) loci into mice in which l he 
endogenous Ig genes have been inactivated offers the opportunity to study of the mechanisms 
^underlying programmed expression and assembly of antibodies as well as their role in B-cell 
5 development. Furthermore, such a strategy could provide an ideal source for production of fully 
human monoclonal antibodies (Mabs) - an important milestone towards fulfilling the promise of 
antibody therapy in human disease. Fully human antibodies are expected to minimize the 
immunogenic and allergic responses intrinsic to mouse or mouse-derivatized Mabs and thus to 
increase the efficacy and safety of the administered antibodies. The use of fully human antibodies can 
10 be expected to provide a substantial advantage in the treatment of chronic and recurring human 
diseases, such as inflammation, autoimmunity, and cancer, which require repeated antibody 
administrations. 

One approach towards this goal was to engineer mouse strains deficient in mouse antibody 
production with large fragments of the human Ig loci in anticipation that such mice would produce 

1 5 a large repertoire of human antibodies in the absence of mouse antibodies. Large human Ig fragments 
would preserve the large variable gene diversity as well as the proper regulation of antibody 
production and expression. By exploiting the mouse machinery for antibody diversification and 
selection and the lack of immunological tolerance to human proteins, the reproduced human antibody 
repertoire in these mouse strains should yield high affinity antibodies against any antigen of interest, 

20 including human antigens. Using the hybridoma technology, antigen-specific human Mabs with the 
desired specificity could be readily produced and selected. 

This general strategy was demonstrated in connection with our generation of the first 
XenoMouse™ strains as published in 1994. See Green et al. Nature Genetics 7:13-21 (1994). The 
XenoMouse™ strains were engineered with 245 kb and 190 kb-sized germline configuration 

25 fragments of the human heavy chain loci and kappa light chain loci, respectively, which contained 
core variable and constant region sequences. Id The human lg containing yeast artificial 
chromosomes (YACs) proved to be compatible with the mouse system for both rearrangement and 
expression of antibodies, and were capable of substituting for the inactivated mouse Ig genes. This 
was demonstrated by their ability to induce B-cell development and to produce an adult-like human 
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repertoire of fully human antibodies and to generate antigen-specific human Mabs. These results also 
suggested that introduction of larger portions of the human Ig loci containing greater numbers of V 
genes, additional regulatory elements, and human Ig constant regions might recapitulate substantially 



07/466,008, filed January 12, 1990, 07/610,515, filed November 8, 1990, 07/919,297, filed July 24, 
1992, 07/922,649, filed July 30, 1992, filed 08/031,801, filed March 15,1993, 08/112,848, filed 
August 27, 1993, 08/234,145, filed April 28, 1994, 08/376,279, filed January 20, 1995, 08/430, 938, 
April 27, 1995, 08/464,584, filed June 5, 1995, 08/464,582, filed June 5, 1995, 08/463, 191, filed June 
10 5, 1995, 08/462,837, filed June 5, 1995, 08/486,853, filed June 5, 1995, 08/486,857, filed June 5, 
1995, 08/486,859, filed June 5, 1995, 08/462,513, filed June 5, 1995, and 08/724,752, filed October 
2, 1996. See also European Patent No., EP 0 463 151 Bl, grant published June 12 r 1996, 
International Patent Application No., WO 94/02602, published February 3, 1994, International Patent 
Application No., WO 96/34096, published October 31, 1996, and PCT Application No. 
15 PCT/US96/05928, filed April 29, 1996. The disclosures of each of the above-cited patents and 
applications are hereby incorporated by reference in their entirety. 

In an alternative approach, others, including GenPharm International, Inc., have utilized a 
"minilocus" approach. In the minilocus approach, an exogenous Ig locus is mimicked through the 
inclusion of pieces (individual genes) from the Ig locus. Thus, one or more V H genes, one or more 
20 D H genes, one or more J H genes, a mu constant region, and a second constant region (preferably a 
gamma constant region) are formed into a construct for insertion into an animal. This approach is 
described in U.S. Patent No. 5,545,807 to Surani et al. and U.S. Patent Nos. 5,545,806 and 
5,625,825, both to Lonberg and Kay, and GenPharm International U.S. Patent Application Serial 
Nos. 07/574,748, filed August 29, 1990, 07/575,962, filed August 31, 1990, 07/810,279, filed 
25 December 17, 1991, 07/853,408, filed March 18, 1992, 07/904,068, filed June 23, 1992, 07/990,860, 
filed December 16, 1992, 08/053,131, filed April 26, 1993, 08/096,762, filed July 22, 1993, 
08/155,301, filed November 18, 1993, 08/161,739, filed December 3, 1993, 08/165,699, filed 
December 10, 1993, 08/209,741, filed March 9, 1*94, the disclosures of which are hereby 
incorporated by reference. See also International Patent Application Nos. WO 94/25585, published 



5 



the full repertoire that is characteristic of the human humoral response to infection and immunization. 
Such approach is further discussed and delineated in U.S. Patent Application Serial Nos. 
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November 10, 1994, WO 93/12227, published June 24, 1993, WO 92/22645, published December 
23, 1992, WO 92/03918, published March 19, 1992, the disclosures of which are hereby incorporated 
by reference in their entirety. See further Taylor et al., 1992, Chen et al., 1993, Tuaillon et al., 1993, 
Choi et al., 1993, Lonberg et al., (1994), Taylor et al., (1994), and Tuaillon et al., (1995), the 
5 disclosures of which are hereby incorporated by reference in their entirety. 

The inventors of Surani et al., cited above, and assigned to the Medical Research Counsel (the 
"MRC"), produced a transgenic mouse possessing an Ig locus through use of the minilocus approach. 
The inventors on the GenPharm International work, cited above, Lonberg and Kay, following the lead 
of the present inventors, proposed inactivation of the endogenous mouse Ig locus coupled with 
10 substantial duplication of the Surani et al. work. 

An advantage of the minilocus approach is the rapidity with which constructs including 
portions of the Ig locus can be generated and introduced into animals. Commensurately, however, 
a significant disadvantage of the minilocus approach is that, in theory, insufficient diversity is 
introduced through the inclusion of small numbers of V, D, and J genes. Indeed, the published work 
1 5 appears to support this concern. B-cell development and antibody production of animals produced 
through use of the minilocus approach appear stunted. Therefore, the present inventors have 
consistently urged introduction of large portions of the Ig locus in order to achieve greater diversity 
and in an effort to reconstitute the immune repertoire of the animals. 

Accordingly, it would be desirable to provide transgenic animals containing more complete 
20 germline sequences and configuration of the human Ig locus. It would be additionally desirable to 
provide such locus against a knockout background of endogenous Ig. 

Summary of the Invention 
Provided in accordance with the present invention are transgenic animals having a near 
25 complete human Ig locus, including both a human heavy chain locus and a human kappa light chain 
locus. Preferably, the heavy chain locus includes greater than about 20%, more preferably greater 
than about 40%, more preferably greater than about 50%, and even more preferably greater than 
about 60% of the human heavy chain variable region. In connection with the human kappa light 
chain, preferably, the locus includes greater than about 20%, more preferably greater than about 40%, 
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more preferably greater than about 50%, and even more preferably greater than about 60% of the 
human kappa light chain variable region. Such percentages preferably refer to percentages of 
functional variable region genes. 

Further, preferably such animals include the entire D H region, the entire J H region, the human 
5 mu constant region, and can additionally be equipped with genes encoding other human constant 
regions for the generation of additional isotypes. Such isotypes can include genes encoding y u y 2i 
Y* a i € > P> ot her constant region encoding genes. Alternative constant regions can be included 
on the same transgene, i.e., downstream from the human mu constant region, or, alternatively, such 
other constant regions can be included on another chromosome. It will be appreciated that where 

10 such other constant regions are included on the same chromosome as the chromosome including the 
human mu constant region encoding transgene, cis-switching to the other isotype or isotypes can be 
accomplished. On the other hand, where such other constant region is included on a different 
chromosome from the chromosome containing the mu constant region encoding transgene, trans- 
switching to the other isotype or isotypes can be accomplished. Such arrangement allows tremendous 

1 5 flexibility in the design and construction of mice for the generation of antibodies to a wide array of 
antigens. 

Preferably, such mice additionally do not produce functional endogenous immunoglobulins. 
This is accomplished in a preferred embodiment through the inactivation (or knocking out) of 
endogenous heavy and light chain loci. For example, in a preferred embodiment, the mouse heavy 

20 chain J-region and mouse kappa light chain J-region and C K -region are inactivated through utilization 
of homologous recombination vectors that replace or delete the region. Such techniques are 
described in detail in our earlier applications and publications. 

Unexpectedly, transgenic mice in accordance with the invention appear to possess an almost 
entirely reconstituted immune system repertoire. This is dramatically, demonstrated when four 

25 separate mouse strains are compared: a first strain contains extensive human heavy chain variable 
regions and human kappa light chain variable regions and encodes only a mu isotype, a second strain 
contains extensive human heavy chain variable regions and human kappa light chain variable regions 
and encodes a mu and gamma-2 isotypes, a third strain contains significantly less human heavy and 
kappa light chain variable regions, and a fourth strain contains a double-inactivated mouse Ig locus. 
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The first and second strains undergo similar, if not identical, B-cell development, whereas the third 
strain has a reduced development and maturation of B-cells, and the fourth strain contains no mature 
B-cells. Further, it is interesting to note that production of human antibodies in preference to mouse 
% " antibodies is substantially elevated in mice having a knock-out background of endogenous Ig. That 
5 is to say that mice that contain a human Ig locus and a functionally inactivated endogenous Ig 
produce human antibodies at a rate of approximately 100 to 1000 fold as efficiently as mice that 
contain only a human Ig locus. 

Thus, in accordance with a first aspect of the present invention there is provided a transgenic 
non-human mammal having a genome, the genome comprising modifications, the modifications 
10 comprising: an inactivated endogenous immunoglobulin (Ig) locus, such that the mammal would not 
display normal B-cell development; an inserted human heavy chain Ig locus in substantially germline 
configuration, the human heavy chain Ig locus comprising a human mu constant region and regulatory 
and switch sequences thereto, a plurality of human J H genes, a plurality of human D H genes, and a 
plurality of human V H genes; and an inserted human kappa light chain Ig locus in substantially 
15 germline configuration, the human kappa light chain Ig locus comprising a human kappa constant 
region, a plurality of Jk genes, and a plurality of Vk genes, wherein the number of V H and Vk genes 
inserted are selected to substantially restore normal B-cell development in the mammal. In a preferred 
embodiment, the heavy chain Ig locus comprises a second constant region selected from the group 
consisting of human gamma- 1, human gamma-2, human gamma-3, human gamma-4, alpha, delta, and 
20 epsilon. In another preferred embodiment, the number of V H genes is greater than about 20. In 
another preferred embodiment, the number of Vk genes is greater than about 15. In another 
preferred embodiment, the number of D H genes is greater than about 25, the number of J H genes is 
greater than about 4, the number of V H genes is greater than about 20, the number of Jk genes is 
greater than about 4, and the number of Vk genes is greater than about 15. In another preferred 
25 embodiment, the number of D H genes, the number of J H genes, the number of V H genes, the number 
of Jk genes, and the number of Vk genes are selected such that the Ig loci are capable of encoding 
greater than about 1 x 10 5 different functional antibody sequence combinations. In a preferred 
embodiment, in a population of mammals B-cell function is reconstituted on average to greater than 
about 50% as compared to wild type. 
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In accordance with a second aspect of the present invention there is provided an improved 
transgenic non-human mammal having a genome that comprises modifications, the modifications 
rendering the mammal capable of producing human immunoglobulin molecules but substantially 
incapable of producing functional endogenous immunoglobulin molecules, the improvement 
5 comprising: insertion into the genome of the mammal of sufficient human V H , D H , J Hl Vic, and Jk 
genes such that the mammal is capable encoding greater than about 1 x 1 0 6 different functional human 
immunoglobulin sequence combinations. 

In accordance with a third aspect of the present invention, there is provided an improved 
transgenic non-human mammal having a genome that comprises modifications, the modifications 
10 rendering the mammal capable of producing human immunoglobulin molecules but substantially 
incapable of producing functional endogenous immunoglobulin molecules, which modifications, with 
respect to the mammal's incapacity to produce functional endogenous immunoglobulin molecules 
would not allow the mammal to display normal B-cell development, the improvement comprising: 
insertion into the genome of the mammal of sufficient human V H , D H , J H , Vk, and Jk genes such that 
15 the mammal is capable of encoding greater than about 1 x 10 6 different functional human 
immunoglobulin sequence combinations and sufficient V H and Vk genes to substantially restore 
normal B-cell development in the mammal In a preferred embodiment, in a population of mammals 
B-cell function is reconstituted on average to greater than about 50% as compared to wild type. 

In accordance with a fourth aspect of the present invention, there is provided a transgenic 
20 non-human mammal having a genome, the genome comprising modifications, the modifications 
comprising: an inactivated endogenous heavy chain immunoglobulin (Ig) locus; an inactivated 
endogenous kappa light chain Ig locus; an inserted human heavy chain Ig locus, the human heavy 
chain Ig locus comprising a nucleotide sequence substantially corresponding to the nucleotide 
sequence of yH2; and an inserted human kappa light chain Ig locus, the human kappa light chain Ig 
25 locus comprising a nucleotide sequence substantially corresponding to the nucleotide sequence of 
yK2. 

In accordance with a fifth aspect of the present invention there is provided a transgenic non- 
human mammal having a genome, the genome comprising modifications, the modifications 
comprising: an inactivated endogenous heavy chain immunoglubulin (Ig) locus; an inserted human 
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heavy chain Ig locus, the human heavy chain Ig locus comprising a nucleotide sequence substantially 
corresponding to the nur^otide sequence of yH2; and an inserted human kappa light chain Ig locus, 
the human kappa light chain Ig locus comprising a nucleotide sequence substantially corresponding 
s to the nucleotide sequence of yK2, 
5 In accordance with a sixth aspect of the present invention, there is provided a transgenic non- 

human mammal having a genome, the genome comprising modifications, the modifications 
comprising: an inactivated endogenous heavy chain immunoglubulin (Ig) locus; an inactivated 
endogenous kappa light chain Ig locus; an inserted human heavy chain Ig locus, the human heavy 
chain Ig locus comprising a nucleotide sequence substantially corresponding to the nucleotide 

10 sequence of yH2 without the presence of a human gamma-2 constant region; and an inserted human 
kappa light chain Ig locus, the human kappa light chain Ig locus comprising a nucleotide sequence 
substantially corresponding to the nucleotide sequence of yK2. 

I n accordance with a seventh aspect of the present invention, there is provideA 
transgenic non-human mammal having a genome, the genome comprising modifications, the 

15 modifications comprising: an inactivated endogenous heavy chain immunoglubulin (Ig) locus; an 
inserted human heavy chain Ig locus, the human heavy chain Ig locus comprising a nucleotide 
sequence substantially corresponding to the nucleotide sequence of yH2 without the presence of a 
human gamma-2 constant region; and an inserted human kappa light chain Ig locus, the human kappa 
light chain Ig locus comprising a nucleotide sequence substantially corresponding to the nucleotide 
20 sequence of yK2. 

In accordance with an eighth aspect of the present invention, there is provided a method for 
the production of human antibodies, comprising: inoculating any of the mammals of the first through 
fifth aspects of the invention discussed above with an antigen; collecting and immortalizing 
lymphocytic cells to obtain an immortal cell population secreting human antibodies that specifically 
25 bind to the antigen with an affinity of greater than 10 9 M' ! ; and isolating the antibodies from the 
immortal cell populations. 

In a preferred embodiment, the antigen is DL-8. In another preferred embodiment, the antigen 
is EGFR. In another preferred embodiment, the antigen is TNF-a. 

In accordance with a ninth aspect of the present invention, there is provided an antibody 
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produced by the method of the sixth aspect of the invention, including antibodies to IL-8, EGFR, and 



In accordance with a tenth aspect of the present invention, there is provided an improved 
method for the production of transgenic mice, the transgenic mice having a genome, the genome 
5 comprising modifications, the modifications comprising insertion of a plurality of human variable 
regions, the improvement comprising: insertion of the human variable regions from a yeast artificial 
chromosome. 

In accordance with an eleventh aspect of the present invention, there are provided transgenic 
mice and transgenic offspring therefrom produced through use of the improvement of the eighth 

1 0 aspect of the present invention. 

In accordance with a twelfth aspect of the present invention, there is provided a transgenic 
mammal, the transgenic mammal comprising a genome, the genome comprising modifications, the 
modifications comprising an inserted human heavy chain immunoglobulin transgene, the improvement 
comprising: the transgene comprising selected sets of human variable region genes that enable 

15 human-like junctional diversity and human-like complementarity determining region 3 (CDR3) 
lengths. In a preferred embodiment, the human-like junctional diversity comprises average N-addition 
lengths of 7.7 bases. In another preferred embodiment, the human-like CDR3 lengths comprise 
between about 2 through about 25 residues with an average of about 14 residues. 

20 BRIEF DESC RIPTION OF THE DRAWING FIGURES 



kappa light chain loci YACs introduced into preferred mice in accordance with the invention. YACs 
spanning the human heavy chain (1H, 2H, 3H, and 4H) and the human kappa light chain proximal 
(IK, 2K, and 3K) loci were cloned from human -YAC libraries. The locations of the different YACs 
25 with respect to the human Ig loci (adopted from Cook and Tomlinson, 1995, and Cox et al., 1994), 
their sizes, and non-Ig sequences are indicated (not shown to scale). The YACs were recombined 
into yeast in a two-step procedure (see Materials and Methods) to reconstruct the human heavy and 
kappa light chain YACs. yH2, the human heavy chain containing YAC, was further retrofitted with 
a human y 2 gene sequence. yK2, was the human kappa light chain containing YAC. The YAC vector 



TNF-a. 



Figure 1 is a schematic representation of the reconstructed human heavy chain and human 
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elements: telomere A, centromere •, mammalian (HPRT, Neo) and yeast selectable markers 
(TRP1, ADE2, LYS2, LEU2, l T RA3, HIS3) on the YAC vector arms are indicated. V H segments 
are classified as genes with open reading frame pseudogenes □, and unsequenced genes O. V K 
segments are classified as genes with open reading frames •, and pseudogenes □. The V genes that 
5 we hav^ found to be utilized by the XenoMouse II are marked (*). The V H gene region contained 
on yH2 is marked by arrows. 

Figure 2 is a series of Southern Blot analyses and characterizations of the human heavy chain 
YAC, yH2, integrated in ES cells and in XenoMouse strains. Figure 2a is a series of Southern Blot 
analyses of EcoRI (a, c) and Bamffl (b, d, e) digested DNA (2ug) prepared from the CGM1 
10 immortalized B-lymphoblast cell line derived from the Washington University YAC library source 
(Brownstein et al., 1989), yH2 YAC (0.5 ug YAC added to 2 ug of 3B1 DNA), unmodified 
E14TG.3B1 (3B1), and yH2-containing ES cell lines: L10, J9.2, L18, L17, and J17. The probes used 
for blotting were human V H 1 (a), D H (b) [18 kb fragment in CGM1 lane represents D segments on 
chromosome 16], V H 3 (c), Cu (d) and J H (e). Figure 2b is a series of Southern Blot analyses of 
1 5 EcoRI (a-b) and BamHI (c-d) digested DNA (10 ug) that was prepared from the tails of wiidtype 
(WT, 129xB57BL/6J), XM2A-1, and XM2A-2 (2 individual offspring) mice or from the parental 
yH2-containing ES cell lines L10 (slightly underloaded relative to other samples), J9.2, and 
yK2-containing ES cell line J23. 1 . The probes used were human V H 1 (a), V !r 4 (b), human y-2 (c), 
and mouse 3'-enhancer (d, the 5kb band represents the endogenous mouse 3'-enhancer fragment). 
20 Fragment sizes of molecular weight markers (in kb) are indicated. 

Figure 3 is a series of Southern Blot analyses characterizing the human kappa light chain 
YAC, yK2, integrated in ES cells and in XenoMouse 2A Strains Figure 2a is a series of Southern 
Blot analyses of EcoRI (a, c, d) and BamHI (b, e ) digested DNA (2 ug) prepared from CGM1 cell 
line (Brownstein et al., 1989, supra), yK2 YAC (0.5 ug YAC DNA added to 2 ug of 3B1 DNA), 
25 unmodified E14TG.3B1 (3B1), and yK2-containing ES cell lines: J23. 1 and J23.7. The probes used 
were human Va (a), Kde (b), V r II (c), V K III (d), and C K (e). Figure 2b is a series of Southern Blot 
analyses of EcoRI-digested DNA (2 ug) that was prepared from the tails of wiidtype (WT, 129xB6), 
XM2A-1, and XM2A-2 (2 individual offspring) mice or from the parental yH2-containing ES cell 
lines L10 (slightly underloaded relative to other samples), J9.2, and yK2-containing ES cell line J23. 1 . 
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The probes that were used were human V K I (a), V K IV (b), V K VI (c) and 3-enhancer (d). Fragment 
sizes of molecular weight markers (in kb) are indicated. 

Figure 4 shows B-cell reconstitution and surface expression of human |i, 8, and k chains on 
XenoMouse-derived B-cells and shows flow cytometry analysis of peripheral blood (Fig. 4a) and 
5 spleen (Fig. 4b) lymphocytes from wildtype mice (WT), double inactivated mice (DI), and 
XenoMouse strains 2A-1 and 2A-2 (XM2A-1, XM2A-2). Four-color flow cytometry analysis was 
carried out using antibodies to the B-cell-specific marker B220 in combination with anti-human \i 9 
6, k, or mouse \i, 6, k, or A. The percentage of positively-stained cells is shown in each quadrant. 
Isolation and staining of cells were performed as described in Materials and Methods. Populations 
10 of human k + and mouse A* cells were determined after first gating for B220>* populations in the 
indicated region. Populations of fT and 8 + cells were determined after first gating for B22(T cells. 
The percentage of positive cells within a region or quadrant is indicated. The FACS profiles shown 
are representative of several experiments performed on each of the strains. 



15 antigens to cells. Figure 5a shows the inhibition of labeled [I 125 ] EL-8 binding to human neutrophils 
by the mouse anti-human IL-8 antibody (R&D Systems) (□) and the fully human Mabs Dl. 1 (♦), 
K2.2 (•), K4.2 (A), and K4.3 (T). The background binding of labeled [I ,25 ]IL-8 in the absence of 
antibody was 2657 cpm. Figure 5b shows the inhibition of labeled [I I25 ]EGF to its receptors on A43 1 
cells by mouse anti-human EGFR antibodies 225 and 528 (□, v, respectively; Calbiochem) and the 

20 fully human antibodies El.l(#), E2.4 (A), E2.5 (T) and E2.1 1 (♦). The background binding of 
' [I I25 ]EGF in the absence of antibodies was 1060 cpm. Figure 5c shows inhibition of labeled [I 125 ] 
TNF-a binding to its receptors on U937 cells by the mouse anti-human TNF-a antibody (R&D 
Systems) (□) and fully human Mabs T22.1 (♦), T22.4 (•), T22.8 (A), and T22.9 (■). The 
background binding of [I 125 ]TNF-a in the absence of antibody was 401 0 cpm. Control human IgG 2 

25 myeloma antibody (a). 

Figure 6 shows repertoire and somatic hypermutation in XenoMouse-derived fully human 
Mabs. Predicted amino acid sequences of four anti-IL-8 (Fig. 6a) and four anti-EGFR (Fig. 6b) 
human IgG 2 K Mabs, divided into CDR1, CDR2 and CPR3 and the constant regions, C Y 2 and C r . The 
D and J genes of each antibody are indicated. The amino acid substitutions from the germline 



Figure 5 shows that XenoMouse-derived human antibodies block the binding of their specific 
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sequences are indicated in bold letters. 

Figure 7 is a schematic diagram of the human heavy chain genome and the human kappa light 
chain genome. 

Figure 8 is another schematic diagram showing the construction of the yH2 (human heavy 
chain) YAC. 

Figure 9 is another schematic diagram showing the construction of the yK2 (human kappa 
light chain) YAC. 

Figure 10 is another schematic diagram showing the construction of the yK2 (human kappa 
light chain) YAC. 

Figure 11 shows a series of Southern Blot analyses demonstrating integration intact of the 
yH2 (human heavy chain) YAC into ES cells and into the mouse genome. Detailed discussion is 
provided in connection with Figure 2. 

Figure 12 shows a series of Southern Blot analyses demonstrating integration intact of the 
yK2 (human kappa light chain) YAC into ES cells and into the mouse genome. Detailed discussion 
is provided in connection with Figure 3. 

Figure 13 shows B-cell reconstitution and surface expression of human fi, 6, and k chains and 
mouse X chains on XenoMouse-derived B-cells and shows flow cytometry analysis of peripheral 
blood. Further details are provided in connection with Figure 4. 

Figure 14 shows production levels of human antibodies by XenoMouse II strains in 
comparison to murine antibody production by wild type mice. 

Figure 15 is a repertoire analysis of human heavy chain transcripts expressed in XenoMouse 
II strains. 

Figure 16 is a repertoire analysis of human kappa light chain transcripts expressed in 
XenoMouse II strains. 

Figure 17 is another depiction of the diverse utilization of human V H and Vk genes that have 
been observed as utilized in XenoMouse II strains. 

Figure 18 shows the titers of human antibody production in XenoMouse II strains. 

Figure 19 is a depiction of gene utilization of anti-IL-8 antibodies derived from XenoMouse 
II strains. 
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Figure 20 shows heavy chain amino acid sequences of anti-EL-8 antibodies derived from 
XenoMouse II strains. 

Figure 21 shows kappa light chain amino acid sequences of anti-IL-8 antibodies derived from 
" XenoMouse II strains. 

Figure 22 shows blockage of IL-8 binding to human neutrophils by human anti-IL-8 
antibodies derived from XenoMouse II strains. 

Figure 23 shows inhibition of CD1 lb expression on human neutrophils by human anti-IL-8 
antibodies derived from XenoMouse II strains. 

Figure 24 shows inhibition of IL-8 induced calcium influx by human anti-IL-8 antibodies 
derived from XenoMouse II strains. 

Figure 25 shows inhibition of IL-8 RB/293 chemotaxsis by human anti-IL-8 antibodies 
derived from XenoMouse II strains. 

Figure 26 is a schematic diagram of a rabbit model of human EL-8 induced skin inflammation. 
Figure 27 shows the inhibition of human IL-8 induced skin inflammation in the rabbit model 
of Figure 26 with human anti-IL-8 antibodies derived from XenoMouse II strains. 

Figure 28 shows inhibition of angiogenesis of endothelial cells on a rat corneal pocket model 
by human anti-IL-8 antibodies derived from XenoMouse II strains. 

Figure 29 is a depiction of gene utilization of human anti-EGFR antibodies derived from 
XenoMouse II strains. 

Figure 30 shows heavy chain amino acid sequences of human anti-EGFR antibodies derived 
from XenoMouse II strains. 

Figure 31 shows blockage EGF binding to A431 cells by human anti-EGFR antibodies 
derived from XenoMouse II strains. 

Figure 32 shows inhibition of EGF binding to SW948 cells by human anti-EGFR antibodies 
derived from XenoMouse II strains. 

Figure 33 shows that human anti-EGFR antibodies derived from XenoMouse II strains inhibit 
growth of SW948 cells in vitro. 

Figure 34 shows inhibition of TNF-a binding to U937 cells through use of human anti-TNF-a 
antibodies derived from XenoMouse II strains. 
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Figure 35 shows kappa light chain amino acid sequences of human anti-EGFR antibodies 
derived from XenoMouse II strains. 



5 TOTALED Dpsc mraoN of the Preferred Embodiments 

Herein we describe the generation and characterization of several strains of mice containing 
substantially germline configuration megabase-sized human Ig loci. The present invention thus 
provides the first demonstration of reconstruction of the large and complex human Ig loci on YACs 
and the successful introduction of megabase-sized YACs into mice to functionally replace the 
1 0 corresponding mouse loci. 

Mouse Strains 

The following mouse strains are described and/or utilized herein: 

15 Double Inactivated (DP Strain: The DI strain of mice are mice that do not produce 

functional endogenous, mouse, Ig. In preferred embodiments, the DI mice possess an inactivated 
mouse J H region and an inactivated mouse Ck region. The construction of this strain is discussed 
extensively elsewhere. For example, the techniques utilized for generation of the DI strains are 
described in detail in U.S. Patent Application Serial Nos. 07/466,008, filed January 12, 1990, 

20 07/610,515, filed November 8, 1990, 07/919,297, filed July 24, 1992, 08/031,801, filed March 15, 
1993, 08/1 12,848, filed August 27, 1993, 08/234, 145, filed April 28, 1994, 08/724,752, filed October 
2, 1996. See also European Patent No, EP 0 463 151 Bl, grant published June 12, 1996, 
International Patent Application No., WO 94/02602, published February 3, 1994, International Patent 
Application No., WO 96/34096, published October 31, 1996, and PCT Application No. 

25 PCT/US96/05928, filed April 29, 1996. The disclosures of each of the above-cited patent and patent 
applications are hereby incorporated by reference in their entirety. It has been observed and reported 
that DI mice possess a very immature B-cell development. The mice do not produce mature B-cells, 
only pro-B-cells. 



- 14- 



WO 98/24893 



PCT7US97/23091 



XenoMouse I Strain : The design, construction, and analysis of the XenoMouse I strain was 
discussed in detail in Green et al., Nature Genetics, 7:13-21 (1994). Such mice pre .jced IgMic 
antibodies against a DI background. The mice showed improved B-cell function when compared to 
the DI strain of mice which have little to no B-cell development. While XenoMouse I strains of mice 
5 were capable of mounting a sizeable immune response to antigenic challenge, there appeared to be 
inefficient in their production of B-cells and possessed a limited response to different antigens which 
apparently was related to their limited V-gene repertoire. 

L6 Strain : The L6 strain is a mouse producing IgMic antibodies against a DI background 

10 of endogenous mouse Ig. L6 mice contain an inserted human heavy chain and an inserted human 
kappa light chain. The L6 strain is generated through breeding of a mouse containing a heavy chain 
insert against a double inactivated background (L6H) and a mouse having a kappa light chain insert 
against a double inactivated background (L6L). The heavy chain insert comprises an intact 
approximately 970 kb human DNA insert from a YAC containing approximately 66 V H segments, 

1 5 starting at Vj^-l and ending at V H 3-65, and including the major D gene clusters (approximately 32), 
J H genes (6), the intronic enhancer (Em), C\i, and through about 25 kb past C8, in germline 
configuration. The light chain insert comprises an intact approximately 800 kb human DNA insert 
from a YAC which contains approximately 32 V K genes starting at V K . B3 and ending at V^u. The 
800 kb insert contains a deletion of approximately 100 kb starting at V K . Lp _ 13 and ending at V K . Lp-5 . 

20 However, the DNA is in germline configuration from V K . Lp . 13 to 100 kb past V KOp .„ and also contains 
the J K genes, the intronic and 3* enhancers, the constant C K gene, and Kde. The L6H and L6L mice 
have been shown to access the full spectrum of the variable genes incorporated into their genome. 
It is expected that the L6 mice will similarly access the full spectrum of variable genes in their 
genome. Furthermore, L6 mice will exhibit predominant expression of human kappa light chain, a 

25 large population of mature B-cells, and normal levels of IgM* human antibodies. Such mice will 
mount a vigorous human antibody response to multiple immunogens, ultimately yielding 
antigen-specific fully human Mabs with subnanomolar affinities. 

XenoMous e Ha Strain: The XenoMouse Ila mice represent our second generation 
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XenoMouse™ strains equipped with germline configuration megabase-sized human Ig loci, against 
a DI background, such thv the mice do not produce functional endogenous Ig. Essentially, the mice 
are equivalent in construction to the L6 strain, but additionally include the human y2 gene with its 
entire switch and regulatory sequences and the mouse 3' enhancer in cis. The mice contain an 
5 approximately 1020 kb heavy and an approximately 800 kb kappa light chain loci, reconstructed on 
YACs, which include the majority of the human variable region genes, including heavy chain genes 
(approximately 66 Vh) and kappa light chain genes (approximately 32 VJ, human heavy constant 
region genes (|i, 5, and y) and kappa constant region genes (CJ, and all of the major identified 
regulatory elements. These mice have been shown to access the full spectrum of the variable genes 
10 incorporated into their genome. Furthermore, they exhibit efficient class switching and somatic 
hypermutation, predominant expression of human kappa light chain, a large population of mature B- 
cells, and normal levels of IgM K and lgG K human antibodies. Such mice mount a vigorous human 
antibody response to multiple immunogens, including human IL-8, human EGF receptor (EGFR), and 
human tumor necrosis factor-a (TNF-a), ultimately yielding antigen-specific fully human Mabs with 
15 subnanomolar affinities. This last result conclusively demonstrates XenoMouse™ as an excellent 
source for rapid isolation of high affinity, fully human therapeutic Mabs against a broad spectrum of 
antigens with any desired specificity. 

As will be appreciated from the above-introduction, the XenoMouse II strain appears to 
undergo mature B-cell development and mount powerful adult-human-like immune responses to 
20 antigenic challenge. The L6 strain, as predicted from the data in connection with L6L and L6H mice, 
also appear to undergo mature B-cell development and mount powerful adult-human-like immune 
responses to antigenic challenge. When DI mice are compared to XenoMouse I strains and DI and 
XenoMouse I strains are compared to L6 and XenoMouse II strains, a markedly different B-cell 
development profile is observed. Owing to this difference, it appears that the quantity and/or quality 
25 of variable region sequences introduced into the animals are essential to the induction B-cell 
maturation and development and the generation of an adult-human-like immune response. Thus, in 
addition to the strains' clear use in the generation of human antibodies, the strains provide a valuable 
tool for studying the nature of human antibodies in the normal immune response, as well as the 
abnormal response characteristic of autoimmune disease and other disorders. 



- 16- 



WO 98/24893 



PCT/US97/23091 



Variable Region - Quantitative Diversity 

It is predicted that *he specificity of antibodies (i.e., the ability to generate antibodies to a wide 
spectrum of antigens and indeed to a wide spectrum of independent epitopes thereon) is dependent 
upon the variable region genes on the heavy chain (Vh) and kappa light chain (VJ genome. The 
5 human heavy chain genome includes approximately 95 functional genes which encode variable regions 
of the human heavy chain of immunoglobulin molecules. In addition, the human light chain genome 
includes approximately 40 genes on its proximal end which encode variable regions of the human 
kappa light chain of immunoglobulin molecules. We have demonstrated that the specificity of 
antibodies can be enhanced through the inclusion of a plurality of genes encoding variable light and 
10 heavy chains. 

Provided in accordance with the present invention are transgenic mice having a substantial 
portion of the human Ig locus, preferably including both a human heavy chain locus and a human 
kappa light chain locus. In preferred embodiments, therefore, greater than 10% of the human V H and 
V K genes are utilized. More preferably, greater than about 20%, 30%, 40%, 50%, 60%, or even 70% 

1 5 or greater of V H and V K genes are utilized. In a preferred embodiment, constructs including 32 genes 
on the proximal region of the V K light chain genome are utilized and 66 genes on the V H portion of 
the genome are utilized. As will be appreciated, genes may be included either sequentially, i.e., in the 
order found in the human genome, or out of sequence, i.e., in an order other than that found in the 
human genome, or a combination thereof. Thus, by way of example, an entirely sequential portion 

20 of either the V H or V K genome can be utilized, or various V genes in either the V H or V K genome can 
be skipped while maintaining an overall sequential arrangement, or V genes within either the V H or 
V K genome can be reordered, and the like. In a preferred embodiment, the entire inserted locus is 
provided in substantially germline configuration as found in humans. In any case, it is expected and 
the results described herein demonstrate that the inclusion of a diverse array of genes from the V H and 

25 V K genome leads to enhanced antibody specificity and ultimately to enhanced antibody affinities. 

Further, preferably such mice include the entire D H region, the entire J H region, the human mu 
constant region, and can additionally be equipped with other human constant regions for the coding 
and generation of additional isotypes of antibodies. Such isotypes can include genes encoding y lt y 2 , 
y 3 , Y* a » e > and 8 and other constant region encoding genes with appropriate switch and regulatory 
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sequences. As will be appreciated, and as discussed in more detail below, a variety of switch and 
regulatory sequences can be appropriately utilized in connection with any particular constant region 
selection. 

The following Table indicates the diversity of antibody combinations that are possible in 
5 humans, based strictly on random V-D-J joining and combination with kappa light chains, without 
consideration of N-addition or somatic mutation events. Based on these considerations, there are 
greater than 3.8 million possible antibody combinations in humans, of any particular isotype. 

TABLE I 

10 



Region 


Heaw Chain 


Kappa Light Chain 


Variable "V" 


-95 


40 


Diversity "D" 


*32 




Joining T 


6 


5 


Combinations (VxDxJ) 


18,240 


200 


Total Combinations 
(HC Combinations x LC 
Combinations) 


3.65 X10 6 



20 In connection with a preferred embodiment of the invention, through the inclusion of about 

66 V H genes and 32 V K genes in a mouse with a full complement of D H , J H , and J K genes, the 
possible diversity of antibody production is on the order of 2.03 X 10 6 different antibodies. As 
before, such calculation does not take into account N-addition or somatic mutation events. 
Therefore, it will be appreciated that mice in accordance with the invention, such as the L6 and the 

25 XenoMouse II strains, offer substantial antibody diversity. In preferred embodiments, mice are 
designed to have the capability of producing greater than 1 X 10 6 different heavy chain V-D-J 
combinations and kappa light chain V-J combinations, without accounting for N-additions or somatic 
mutation events. 
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Variable Region - Qualitative Diversify 

In addition to quantitative diversity, quantitative selection of V-genes (i.e., large and diverse 
numbers of V-genes) and/or qualitative selection of V-genes (i.e., selection of particular V-genes) 
appears to play a role in what we refer to herein as "qualitative diversity." Qualitative diversity, as 
used herein, refers to diversity in V-D-J rearrangements wherein junctional diversity and/or somatic 
mutation events are introduced. During heavy chain rearrangement, certain enzymes (RAG-1, RAG- 
2, and possibly others) are responsible for the cutting of the DNA representing the coding regions of 
the antibody genes. Terminal deoxynucleotidyl transferase (Tdt) activity is upregulated which is 
responsible for N-terminal additions of nucleotides between the V-D and D-Jgene segments. Similar 
enzymes and others (SCID and other DNA repair enzymes) are responsible for the deletion that 
occurs at the junctions of these coding segments. With respect to junctional diversity, both N- 
addition events and formation of the complementarity determining region 3 (CDR2; are included 
within such term. As will be appreciated, CDR3 is located across the D region and includes the V-D 
and D-J junctional events. Thus, N-additions and deletions during both D-J rearrangement and V-D 
rearrangement are responsible for CDR3 diversity. 

It has been demonstrated that there are certain differences between murine and human 
junctional diversities. In particular, some researchers have reported that murine N-addition lengths 
and CDR3 lengths are generally shorter than typical human N-addition lengths and CDR3 lengths. 
Such groups have reported that, in humans, N-additions of about 7.7 bases in length, on average, are 
typically observed. Yamadaetal. (1991). Mouse-like N-additions are more often on the order of 
about 3 bases in length, on average. Feeneyetal. (1990). Similarly, human-like CDR3 lengths are 
longer than mouse-like CDR3*s. In man CDR3 lengths of between 2 and 25 residues, with an average 
of 14 residues, is common. In mice, some groups have reported shorter average CDR3 lengths. 

The junctional diversity created by N-additions and CDR3 additions play a clear role 
developing antibody specificity. 

In accordance with the invention, rearranged V-D-J gene sequences show N-addition lengths 
that are comparable to expected adult-human N-addition lengths. Further, amino acid sequences 
across the open reading frame (ORF) corresponding to CDR3 sequences show CDR3 lengths that 
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are comparable to expected adult-human CDR3 lengths. Such data is indicative that quantitative 
variable region diversity and/or qualitative variable region diversity results in human-like junctional 
diversity. Such junctional diversity is expected to lead to a more human-like antibody specificity. 

Variable Region - Affinities 

While we have not conclusively demonstrated a direct causal connection between the 
increased variable region inclusion and antibody specificity, it appears, and it is expected that through 
providing such diversity, the ability of the mouse to mount an immune response to a wide array of 
antigens is possible and enhanced. Additionally, such mice appear more equipped to mount immune 
responses to a wide array of epitopes upon individual antigens or immunogens. From our data it also 
appears that antibodies produced in accordance with the present invention possess enhanced affinities. 
Such data includes comparisons between mice in accordance with the invention and the XenoMouse 
I strains, as well as consideration of the published results of GenPharm International and the MRC. 
In connection with the XenoMouse I strains, as mentioned above, such mice possessed inefficient B- 
cell production and a limited response to different antigens. Such result appeared related in part to 
the limited V-gene repertoire. Similarly, results reported by GenPharm International and the MRC 
indicate a limited response to diverse antigens. 

Without wishing to bound to any particular theory or mode of operation of the invention, it 
would appear that enhanced affinities appear to result from the provision of the large number of V 
regions. From bur data, the provision of greater numbers and/or selection of qualities of V-gene 
sequences, enhances junctional diversity (N-additions and formation of complementarity determining 
region 3 ("CDR3") diversity), which is typical of an adult-human-like immune response, and which 
play a substantial role in affinity maturation of antibodies. It may also be that such antibodies are 
more effective and efficient in somatic mutation events that lead to enhanced affinities. Each of 
junctional diversity and somatic mutation events are discussed in additional detail below. 

With respect to affinities, antibody affinity rates and constants derived through utilization of 
plural V H and V K genes (i.e., the use of 32 genes on the proximal region of the V K light chain genome 
and 66 genes on the V„ portion of the genome) results in association rates (ka in M" l S* 1 ) of greater 
than about 0.50 X 10"*, preferably greater than 2.00 X 10" 6 , and more preferably greater than about 
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4.00 X 10**; dissociation rates (kd in S' 1 ) of greater than about 1.00 X 10" 4 , preferably greater than 
about 2.00 X 10" 4 , and more preferably greater than about 4.00 X 10" 4 ; and dissociation constant (in 
M) of greater than about 1 .00 X 10" 10 , preferably greater than about 2.00 X 10' 10 , and more preferably 
greater than about 4.00 X 10~ l ° 
5 Preferably, such mice additionally do not produce functional endogenous immunoglobulins. 

This is accomplished in a preferred embodiment through the inactivation (or knocking out) of 
endogenous heavy and light chain loci. For example, in a preferred embodiment, the mouse heavy 
chain J-regjon and mouse kappa light chain J-region and Q-region are inactivated through utilization 
of homologous recombination vectors that replace or delete the region. 

10 

Variable Region - B-cell Development 

B-cell development is reviewed in Klaus B Lymphocytes (IRL Press ( 1 990)) and Chapters 1 -3 
of Immunoglobulin Genes (Academic Press Ltd. (1989)), the disclosures of which are hereby 
incorporated by reference. Generally, in mammals, blood cell development, including B- and T-cell 

15 lymphocytes, originate from a common pluripotent stem cell. The lymphocytes, then, evolve from 
a common lymphoid progenitor cell. Following an early gestational period, B-cell initiation shifts 
from the liver to the bone marrow where it remains throughout the life of the mammal. 

In the life cycle of a B-cell, the first generally recognizable cell is a pro-pre-B-cell which is 
found in the bone marrow. Such a cell has begun heavy chain V-D-J rearrangement, but does not yet 

20 make protein. The cell then evolves into a large, rapidly dividing, pre-B-cell I which is a 
cytoplasmically \i* cell. This pre-B-cell I then stops dividing, shrinks, and undergoes light chain V-J 
rearrangement becoming a pre-B-cell II which expresses surface IgM, which leave the marrow as 
immature B-cells. Most of the emerging immature B-cells continue to develop and to produce 
surface IgD, indicative of their completion of differentiation and development as fully mature 

25 immunocompetent peripheral B-cells, which reside primarily in the spleen. However, it is possible 
to eliminate the delta constant region and still obtain immunocompetent cells. 

B-cell differentiation and development can be monitored and/or tracked through the use of 
surface markers. For example, the B220 antigen is expressed in relative abundance on mature B-cells 
in comparison to pre-B-cells I or II. Thus, cells that are B220* and surface IgM* (fi*) can be utilized 
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to determine the presence of mature B-cells. Additionally, cells can be screened for surface IgD 
expression (6*). Another antigen, heat stable antigen, is expressed by pre-B-cells II as they transition 
to the periphery (i.e., as they become u* and/or \i\ 5~). 



TABLE H 





Bone Marrow 


Spleen 


Marker 


pro-pre-B-cell 


pre-B-cell I 


pre-B-<wU II 
emerging B-cell 


immature B-cell 


mature B-cell 


B220 






± 


+ 


++ 


HSA 






+ 


± 




M 








+ 


+ 


6* 










+ 



Assuming the presence of a functional copy of the C6 gene on the transgene. 



Through use of B-cell markers, such as those mentioned above, development and 
differentiation of B-cells can be monitored and assessed. 

We have previously demonstrated that DI mice (mice that do not undergo heavy chain V-D-J 
rearrangement or light chain V-J rearrangement) do not produce mature B-cells. In fact, such mice 
arrest at the production of pro-pre-B-cells and B-cells never move from the bone marrow to 
peripheral tissues, including the spleen. Thus, both B-cell development and antibody production are 
completely arrested. The same result is seen in mice that are only heavy chain inactivated; B-cell 
development and differentiation arrests in the bone marrow. 

Our XenoMouse I strain produced functional, somewhat mature B-cells. However, the 
numbers of B-cells, in both the bone marrow and peripheral tissues, were significantly reduced 
relative to wild type mice. 

In contrast, our XenoMouse II strains and L6 strains, unexpectedly possess almost complete 
B-cell reconstitution. Therefore, in accordance with the invention, we have demonstrated that 
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through the quantitative inclusion or qualitative inclusion of variable region genes B-cell 
differentiation and development can be greatly reconstituted. Reconstitution of B-cell differentiation 
and development is indicative of immune system reconstitution. In general, B-cell reconstitution is 
compared to wild type controls. Thus, in preferred embodiments of the invention, populations of 
5 mice having inserted human variable regions possess greater than about 50% B-cell function when 
compared to populations of wild type mice. 

Further, it is interesting to note that production of human antibodies in preference to mouse 
antibodies is substantially elevated in mice having a knock-out background of endogenous Ig. That 
is to say that mice that contain a human Ig locus and a functionally inactivated endogenous heavy 
10 chain Ig locus produce human antibodies at a rate of approximately 100 to 1000 fold as efficiently 
as mice that only contain a human Ig locus and are not inactivated for the endogenous locus. 

Isotype Switching 

As is discussed in detail herein, as expected, XenoMouse II mice undergo efficient and 
1 5 effective isotype switching from the human transgene encoded mu isotype to the transgene encoded 
gamma-2 isotype. We have also developed XenoMouse II strains that contain and encode the human 
gamma-4 constant region. As mentioned above, mice in accordance with the invention can 
additionally be equipped with other human constant regions for the generation of additional isotypes. 
Such isotypes can include genes encoding y„ y 2 , y 3 , y 4 , a, e, 6, and other constant region encoding 
20 genes. Alternative constant regions can be included on the same transgene, i.e., downstream from 
the human mu constant region, or, alternatively, such other constant regions can be included on 
another chromosome. It will be appreciated that where such other constant regions are included on 
the same chromosome as the chromosome including the human mu constant region encoding 
transgene, cis-switching to the other isotype or isotypes can be accomplished. On the other hand, 
25 where such other constant region is included on a different chromosome from the chromosome 
containing the mu constant region encoding transgene, trans-switching to the other isotype or 
isotypes can be accomplished. Such arrangement allows tremendous flexibility in the design and 
construction of mice for the generation of antibodies to a wide array of antigens. 

It will be appreciated that constant regions have known switch and regulatory sequences that 
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they are associated with. All of the murine and human constant region genes had been sequenced 
and published by 1989 See Honjo et al. "Constant Region Genes of the Immunoglobulin Heavy 
Chain and the Molecular Mechanism of Class Switching" in Immunoglobulin Genes (Honjo et al. 
eds., Academic Press (1989)), the disclosure of which is hereby incorporated by reference. For 
example, in U.S. Patent Application Serial No. 07/574,748, the disclosure of which is hereby 
incorporated by reference, the cloning of the human gamma- 1 constant region was prophesized based 
on known sequence information from the prior art. It was set forth that in the unrearranged, 
unswitched gene, the entire switch region was included in a sequence beginning less than 5 kb from 
the 5' end of the first y-1 constant exon. Therefore the switch region was also included in the 5* 5.3 
kb Hindlll fragment that was disclosed in Ellison et al. Nucleic Acids Res, 10:4071-4079 (1982). 
Similarly, Takahashi et al. Cell 29:671-679 (1982) also reported that the fragment disclosed in Ellison 
contained the switch sequence, and this fragment together with the 7.7 kb Hindlll to BamHI fragment 
must include all of the sequences necessary for the heavy chain isotype switching transgene 
construction. 

Thus, it will be appreciated that any human constant region of choice can be readily 
incorporated into mice in accordance with the invention without undue experimentation. Such 
constant regions can be associated with their native switch sequences (i.e., a human Yi, 2, 3, or 4 constant 
region with a human Yi, 2.3, or 4 switch, respectively) or can be associated with other switch sequences 
(i.e., a human y 4 constant region with a human y 2 switch). Various 3* enhancer sequences can also 
be utilized, such as mouse, human, or rat, to name a few. Similarly other regulatory sequences can 
also be included. 

As an alternative to, and/or in addition to, isotype switching in vivo, B-cells can be screened 
for secretion of "chimeric" antibodies. For example, the L6 mice, in addition to producing fully 
human IgM antibodies, produce antibodies having fully human heavy chain V, D, J regions coupled 
to mouse constant regions, such as a variety of gammas (i.e., mouse IgGl, 2, 3, 4) and the like. Such 
antibodies are highly useful in their own right. For example, human constant regions can be included 
on the antibodies through in vitro isotype switching techniques well known in the art. Alternatively, 
and/or in addition, fragments (i.e., F(ab) and F(ab') 2 fragments) of such antibodies can be prepared 
which contain little or no mouse constant regions. 
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As discussed above, the most critical factor to antibody production is specificity to a desired 
antigen or epitope on an -itigen. Class of the antibody, thereafter, becomes important according to 
the therapeutic need. In other words, will the therapeutic index of an antibody be enhanced by 
providing a particular isotype or class? Consideration of that question raises issues of complement 
fixation and the like, which then drives the selection of the particular class or isotype of antibody. 
Gamma constant regions assist in affinity maturation of antibodies. However, the inclusion of a 
human gamma constant region on a transgene is not required to achieve such maturation. Rather, 
the process appears to proceed as well in connection with mouse gamma constant regions which are 
trans-switched onto the mu encoded transgene. 

Materials and Methods 
The following Materials and Methods were utilized in connection with the generation and 
characterization of mice in accordance with the present invention. Such Materials and Methods are 
meant to be illustrative and are not limiting to the present invention. 

Cloning Human Ig-derived YACs : The Washington University (Brownstein et al., 1989) 
and the CEPH (Abertsen et al., 1990) human- YAC libraries were screened for YACs containing 
sequences from the human heavy and kappa light chain loci as previously described (Mendez et al. 
1995). Cloning and characterization of 1H and IK YACs was described by Mendez et al., (1995). 
3H and 4H YACs were identified from the Washington University library using a V H 3 probe (0.55 
kbPstl/Ncol, Berman et al, 1988). The 17H YAC was cloned from the GM1416 YAC library and 
determined to contain 130 kb of heavy chain variable sequences and a 150 kb chimeric region at its 
3' end Matsuda et. al., 1993. 2K and 3K YACs were recovered from the CHEF library using VJI- 
specific primer (Albertsen et al., 1990). 

YAC targeting and recombination : Standard methods for yeast growth, mating, sporulation, 
and phenotype testing were employed (Sherman et al, 1986). Targeting of YAC s and YAC vector 
arms with yeast and mammalian selectable markers, to facilitate the screening of YAC recombinants 
in yeast of YAC integration into cells, was achieved by lithium acetate transformation (Scheistl and 
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Geitz (1989). After every targeting or recombination step the modified YAC(s) was analyzed by 
pulsed field gel eletrophoresis an H standard Southern Blots to determine the integrity of all sequences. 

YAC targeting vectors were used for the interconversion of centric and acentric arms to 
reorient 17H and to retrofit its 5' arm with LEU2 and URA3 genes and its 3' arm with the HIS3 gene. 
5 See Fig. la and Mendez et al, 1993. The 4H centric arm was retrofitted with the yeast ADE2 gene 
and the human HPRT selectable markers. For the first recombination step, a diploid yeast strain was 
created and selected in which all three YACs 17H, 3H, and 4H were present, intact, and stably 
maintained. A three-way homologous recombination between the YAC overlapping regions was 
induced by sporulation and the desired recombinant was found by the selection of the outer yeast 

1 0 selectable markers (ADE2 and HIS3) and negative selection (loss) of the internal marker URA3. The 
successful recombination created a 880 kb YAC containing 80% of the IgH variable region, starting 
at V H 2-5 and extending 20 kb 5* of the V H 3-65 gene. For the recombination of the 880 kb YAC to 
1H, 1H was retrofitted with pICL, which adds the LYS2 gene to the centric arm (Hermanson et al., 
1991). Using standard yeast mating, a diploid strain was selected containing both 1H and the 880 

15 kb YAC. Upon sporulation and by use of overlapping homology, YAC-yeast recombination was 
carried out. With positive selection for the outer yeast markers (ADE2 and URA3) and screening 
for the loss of the internal markers (TRP1, LYS2, HIS3), an intact 970 kb YAC consisting of 
approximately 66 V H segments, starting at V H 6-1 and ending at V H 3-65 was found. The YAC also 
contained the major D gene clusters, J H genes, the intronic enhancer (E\i\ Cfi, up to 25 kb past C6, 

20 in germline configuration. This 970 kb YAC was then retrofitted with a targeting vector including 
a 23 kb EcoRI genomic fragment of the human y-2 gene, including its switch and regulatory 
elements, a 7 kb Xbal fragment of the murine heavy chain 3* enhancer, neomycin gene driven by the 
metallothionine promoter (MMTNeo), and the yeast LYS2 gene. This vector, while bringing in these 
sequences on the 3' YAC arm, disrupts the URA3 gene. 

25 As a first step toward creating yK2 YAC, by standard yeast mating a diploid yeast strain was 

selected in which retrofitted IK and 3 K YACs were both present, intact, and stably maintained. 
Using the same process as described in connection with the IgH construction, YAC-yeast 
recombination was carried out. Through use of positive selection for the outer yeast markers (LYS2, 
TRP1) and the screening for the loss of internal markers (URA3, TRP1), an intact 800 kb 
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recombinant product was found which contained 32 V K starting at V K . B3 and ending at V K<>pU . The 
800 kb YAC contains a deletion of approximately 100 kb starting at and ending at V K . Lp . 5 . 

However, the YAC is in germline configuration from V K4-p . 13 to 100 kb past V,^.,. The YAC also 
contains J r , the intronic and 3' enhancers, the constant C Kt and Kde. 

YAC introduction into ES cells and mice: YAC-containing yeast spheroplasts were fused 
with E14.TG3B1 ES cells as described (Jakobovits et al., 1993a; Green et al., 1994). HAT-resistant 
colonies were expanded for analysis. YAC integrity was evaluated by Southern Blot analysis using 
protocols and probes described in Herman et al., (1988) and Mendez et al., (1994) and hybridization 
conditions as described in Gemmil et al., (1991). Chimeric mice were generated by microinjection 
of ES cells into C57BL/6 blastocysts. YAC-containing offspring were identified by PGR analysis of 
tail DNA as described (Green et al., 1994). YAC integrity was evaluated by Southern Blot analysis 
using probes and conditions previously described, except that the blot probed with human V H 3 was 
washed at 50°C. 

Flow cytometry analysis: Peripheral blood and spleen lymphocytes obtained from 8-10 week 
old XenoMice and control mice were purified on Lympholyte M (Accurate) and treated with purified 
anti-mouse CD32/CD16 Fc receptor (Pharmingen, 01 24 ID) to block non-specific binding to Fc 
receptors, stained with antibodies and analyzed on a FACStar^ 05 (Becton Dickinson, CELLQuest 
software). Antibodies used: allophycocyanin (APC) anti-B220 (Pharmingen, 01 129 A); biotin 
anti-human IgM (Pharmingen, 08072D); biotin anti-mouse IgM (Pharmingen, 02202D); fluoroscein 
isothiocyanate (FITC) goat F(ab') 2 anti-human IgD (Southern Biotechnology, 2032-02); FITC 
anti-mouse IgD a (Pharmingen, 05064D); FITC anti-mIgD b (Pharmingen, 05074D); FITC anti-mouse 
X (Pharmingen, 02174D); PE anti-human k (Pharmingen, 08 175 A); PE anti-mouse k (Pharmingen, 
02155A.) RED613™-streptavidin (GibcoBRL, 19541-010) was used to detect biotinylated 
antibodies. 

Immunization and hyhridoma generation: XenoMice (8 to 10 weeks old) were immunized 
intraperitoneal^ with 25 \xg of recombinant human IL-8 or with 5 \ig TNF-a (Biosource 
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International) emulsified in complete Freund's adjuvant for the primary immunization and in 
incomplete Freund's adjuvant for the additional immunizations carried out at two week intervals. For 
EGFR immunization, XenoMice were immunized intraperitoneally with 2xl0 7 A431 (ATCC 
CRL-7907) cells resuspended in phosphate buffered saline (PBS). This dose was repeated three 
5 times. Four days before fusion, the mice received a final injection of antigen or cells in PBS. Spleen 
and lymph node lymphocytes from immunized mice were fused with the non-secretory myeloma 
NSO-bcl2 line (Ray and Diamond, 1994), and were subjected to HAT selection as previously 
described (Galfre and Milstein, 1981). 

10 ELISA assay ; ELISA for determination of antigen-specific antibodies in mouse serum and 

in hybridoma supernatants were carried out as described (Coligan et al., 1994) using recombinant 
human DL-8 and TNF-a and affinity-purified EGFR from A43 1 cells (Sigma, E-3641 ) to capture the 
antibodies. The concentration of human and mouse immunoglobulins were determined using the 
following capture antibodies: rabbit anti-human IgG (Southern Biotechnology, 6145-01), goat 

15 anti-human IgK (Vector Laboratories, AI-3060), mouse anti-human IgM (CGI/ ATCC, HB-57), for 
human y, k, and \i Ig, respectively, and goat anti-mouse IgG (Caltag, M 30100), goat anti-mouse IgK 
(Southern Biotechnology, 1050-01), goat anti-mouse IgM (Southern Biotechnology, 1020-01), and 
goat anti-mouse A (Southern Biotechnology, 1060-01) to capture mouse y, and A. Ig, 

respectively. The detection antibodies used in ELISA experiments were goat anti-mouse IgG-HRP 

20 (Caltag, M-30107), goat anti-mouse IgK-HRP (Caltag, M 33007), mouse anti-human IgG2-HRP 
(Southern Biotechnology, 9070-05), mouse anti-human IgM-HRP (Southern Biotechnology, 
9020-05), and goat anti-human kappa-biotin (Vector, BA-3060). Standards used for quantitation of 
human and mouse Ig were: human IgG 2 (Calbiochem, 400122), human IgMK (Cappel, 13000), 
human IgG 2 K (Calbiochem, 400122), mouse IgGrc (Cappel 55939), mouse IgMK (Sigma, M-3795), 

25 and mouse IgG 3 X (Sigma, M-9019). 

Determinat ion of affinity constants of fully human Mahs by BIAcore : Affinity 
measurement of purified human monoclonal antibodies, Fab fragments, or hybridoma supernatants 
by plasmon resonance was carried out using the BIAcore 2000 instrument, using general procedures 
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outlined by the manufacturers. 

Kinetic analysis of the antibodies was carried out using antigens immobilized onto the sensor 
surface at a low density: human EL-8 -81 RU, soluble EGFR purified from A431 cell membranes 
(Sigma, E-3641)- 303 RU, and TNF-a- 107 RU (1,000 RU correspond to about 1 ng/mm 2 of 
immobilized protein). The dissociation (kd) and association (lea) rates were determined using the 
software provided by the manufacturers, BIAevaluation 2. 1. 

Affinity measurement by radioimmunoassay: ,25 I-labeled human IL-8 (1.5 x 10 n M or 3 
x 10" u M) was incubated with purified anti-IL-8 human antibodies at varying concentrations (5 x 10* 13 
M to 4 x 10- 9 M) in 200 \i\ of PBS with 0.5% BSA. After 15 hrs. incubation at room temperature, 
20 fil of Protein A Sepharose CL-4B in PBS (1/1, v/v) was added to precipitate the antibody-antigen 
complex. After 2 hrs. incubation at 4°C, the antibody- 125 I-IL-8 complex bound to Protein A 
Sepharose was separated from free 125 I-IL-8 by filtration using 96-well filtration plates (Millipore, 
Cat. No. MADVN65), collected into scintillation vials and counted. The concentration of bound and 
free antibodies was calculated and the binding affinity of the antibodies to the specific antigen was 
obtained using Scatchart analysis (2). 

Receptor binding assays: The IL-8 receptor binding assay was carried out with human 
neutrophils prepared either from freshly drawn blood or from buffy coats as described (Lusti- 
Marasimhan et al., 1995). Varying concentrations of antibodies were incubated with 0.23 nM 
[ 125 I]IL-8 (Amersham, IM-249) for 30 min at 4 t in 96-well Multiscreen filter plates (Millipore, 
MADV N6550) pretreated with PBS binding buffer containing 0.1% bovine serum albumin and 
0.02% NaN 3 at 25°C for 2 hours, 4 X 10 5 neutrophils were added to each well, and the plates were 
incubated for 90 min at 4°C. Cells were washed 5 times with 200 nl of ice-cold PBS, which was 
removed by aspiration. The filters were air-dried, added to scintillation fluid, and counted in a 
scintillation counter. The percentage of specifically bound [ 125 I]IL-8 was calculated as the mean cpm 
detected in the presence of antibody divided by cpm detected in the presence of buffer only. 

Binding assays for TNF receptor were performed in a similar manner as the EL-8 assays 
described above. However, the human monocyte line U937 was utilized instead of the neutrophil line 
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used in connection with the LL-8 assays. Antibodies were preincubated with 0.25 nM [ l25 ]TNF 
(Amersham, IM-206). 6 x 10 5 U937 cells were placed in each well. 

The EGF receptor binding assay was carried out with A431 cells (0.4 x 10 6 cells per well) 
which were incubated with varying concentrations of antibodies in PBS binding buffer for 30 minutes 
at 4°C. 0.1 nM [ I25 I]EGF (Amersham, IM-196) was added to each well, and the plates were 
incubated for 90 min at 4°C. The plates were washed five times, air-dried and counted in a 
scintillation counter. Anti-EGFR mouse antibodies 225 and 528 (Calbiochem) were used as controls. 

Repertoire analysis of human Ig transcripts expressed in XenoMice and their derived 
human Mabs: Poly(A) + mRNA was isolated from spleen and lymph nodes of unimmunized and 
immunized XenoMice using a Fast-Track kit (Invitrogen). The generation of random primed cDNA 
was followed by PCR. Human V H or human V K family specific variable region primers (Marks et al., 
1991) or a universal human V H primer, MG-30 (CAGGTGCAGCTGGAGCAGTCIGG) was used 
in conjunction with primers specific for the human C\i (hfiP2) or Ck (hicP2) constant regions as 
previously described (Green et al., 1994), or the human y2 constant region MG-40d; 
5-GCTGAGGGAGTAGAGTCCTG AGGA-3 ' . PCR products were cloned into pCRII using a TA 
cloning kit (Invitrogen) and both strands were sequenced using Prism dye-terminator sequencing kits 
and an ABI 377 sequencing machine. Sequences of human Mabs-derived heavy and kappa chain 
transcripts were obtained by direct sequencing of PCR products generated from poly(A^) RNA using 
the primers described above. All sequences were analyzed by alignments to the "V BASE sequence 
directory" (Tomlinson et al., MRC Centre for Protein Engineering, Cambridge, UK) using Mac Vector 
and Geneworks software programs. 

Preparation and purification o f antibody Fab fragments : Antibody Fab fragments were 
produced by using immobilized papain (Pierce). The Fab fragments were purified with a two step 
chromatographic scheme: HiTrap (Bio-Rad) Protein A column to capture Fc fragments and any 
undigested antibody, followed by elution of the Fab fragments retained in the flow-through on strong 
cation exchange column (PerSeptive Biosystems), with a linear salt gradient to 0.5 M NaCl. Fab 
fragments were characterized by SDS-PAGE and MALDI-TOF MS under reducing and non-reducing 
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conditions, demonstrating the expected -50 kD unreduced fragment and -25 kDa reduced doublet. 
This result demonstrates the intact light chain and the cleaved heavy chain. MS uMer reducing 
conditions permitted the unambiguous identification of both the light and cleaved heavy chains since 
the light chain mass can be precisely determined by reducing the whole undigested antibody. 

5 

EXAMPLES 

The following examples, including the experiments conducted and results achieved are 
provided for illustrative purposes only and are not to be construed as limiting upon the present 
invention. 

10 

Example 1: Reconstruction of human heavy chain loci on YACs 

In accordance with the present invention, the strategy that we utilized to reconstruct the 
human heavy chain and human kappa light chain variable regions was to, first, screen human- YAC 
libraries for YACs that spanned the large (megabase-sized) human Ig loci and, second, to recombine 
1 5 YACs spanning such regions into single YACs containing the desired loci predominantly in germline 
configuration. 

The above, stepwise, YAC recombination scheme exploited the high frequency of 
meiotic-induced homologous recombination in yeast and the ability to select the desired recombinants 
by the yeast markers present on the vector arms of the recombined YACs (See Figure 1, and Green 

20 et al M supra.\see also Silverman et al., 1990 and denDunnen et al., 1992). 

In connection with our strategy, we identified four YACs, 1H (240 kb), 2H (270 kb), 3H (300 
kb), and 4H (340 kb), which spanned about 830 kb, out of the about 1000 kb, of the human heavy 
chain variable region on chromosome 14q. YACs 1H, 2H, 3H, and 4H were used for reconstruction 
of the locus (See Figure 1 A). Pulsed Field Gel Electrophoresis (PFGE) and Southern blot analysis 

25 confirmed the YACs to be in intact, germline configuration, with the exception of 1 50 kb at the 3' end 
of YAC 2H which contained certain non-IgH sequences (See Figure 1; Matsuda et al., 1990). YAC 
1H, the YAC that was previously introduced into our first generation XenoMouse™ (Green et al., 
supra , Mendez et al., 1995), is comprised of the human C B , C p J„, and D H regions and the first 5 V H 
genes in germline configuration. The other three YACs cover the majority of the V H region, from 
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V H 2-5 to V H 3-65, thus contributing approximately an additional 61 different V H genes. Prior to 
recombination, YAC 4H "as retrofitted with an HPRT selectable marker. Through utilization of the 
overlapping sequences contained on the YACs, the four YACs (1H, 2H, 3H, and 4H) were 
recombined in yeast by a stepwise recombination strategy (See Figure 1 A). Such recombination 
strategy generated a 980 kb recombinant YAC (See Figure 1). Analysis of the YAC by PFGE and 
Southern blot analysis confirmed the presence of the human heavy chain locus from the C & region to 
20 kb 5* of the V H 3-65 gene in germline configuration. No apparent deletions or rearrangements were 
observed. 

The YAC acentric arm was targeted with a vector bearing the complete human y2 constant 
region, mouse 3' enhancer, and the neomycin resistance gene, to yield the final 1020 kb heavy chain 
YAC, yH2. YAC yH2 contained the majority of the human variable region i.e., 66 out of the 82 V H 
genes, complete D H (32 genes), and J H (6 genes) regions and three different constant regions (Cfi, 
C6, and Cy) with their corresponding regulatory sequences (See Figure 1 A). This was the heavy 
chain construct utilized for the production of our XenoMouse II strains. 

Example 2: Reconstruction of human kappa light chain loci on YACs 

A similar stepwise recombination strategy was utilized for reconstruction of the human kappa 
light chain locus. Three YACs were identified that spanned the human kappa loci. The YACs were 
designated IK, 2K and 3K. YAC IK, which had a length of approximately 1 80 kb, had previously 
been introduced into our first generation XenoMouse™. Such YAC contained the kappa deleting 
element, (Kde), the kappa 3' and intronic enhancers, C r , J K , and the three V K genes on the B cluster 
(Green et al., 1994; Mendez et al, 1995). YAC 2K (approximately 480 kb), and 3K (approximately 
380 kb) together encompass most of the kappa chain proximal variable region on chromosome 2p. 
A deletion of approximately 100 kb spans the L13-L5 region (Fig. IB; Huber et al., 1993). Inasmuch 
as the kappa distal region duplicates the proximal region, and as the proximal V K genes are the ones 
most commonly utilized humans (Weichold et al., 1993; Cox et al., 1994), the proximal region was 
the focus of our reconstruction strategy (Fig. IB). Through homologous recombination of the three 
YACS, an 800 kb recombinant YAC, yK2, was recovered. The size and integrity of the recombinant 
YAC was confirmed by PFGE and Southern blot analysis. Such analysis demonstrated that it covered 
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the proximal part of the human kappa chain locus, with 32 V K genes in germline configuration except 
for the described deletio * in the Lp region (Fig. IB). yK2 centric and acentric arms were mciified 
to contain the HPRT and neomycin selectable markers, respectively, as described (Materials and 
Methods). This was the kappa light chain construct utilized for the production of our XenoMouse 
II strains. 

The YACs described herein, yH2 and yK2, represent the first megabase-sized reconstructed 
human Ig loci to contain the majority of the human antibody repertoire, predominantly in germline 
configuration. This accomplishment further confirmed homologous recombination in yeast as a 
powerful approach for successful reconstruction of large, complex, and unstable loci. The selection 
of stable YAC recombinants containing large portions of the Ig loci in yeast provided us with the 
human Ig fragments required to equip the mice with the human antibody repertoire, constant regions, 
and regulatory elements needed to reproduce human antibody response in mice. 

Example 3: Introduction of vH2 and vK2 YACs into ES cells 

In accordance with our strategy, we introduced the YACs, yH2 and yK2, into mouse 
embryonic stem (ES) cells. Once ES cells containing the YAC DNA were isolated, such ES cells 
were utilized for the generation of mice through appropriate breeding. 

In this experiment, therefore, YACs yH2 and yK2, were introduced into ES cells via fusion 
of YAC-containing yeast spheroplasts with HPRT-deficient E14.TG3B1 mouse ES cells as previously 
described (Jakobovits et ah, 1993a; Green et al., 1994). HPRT-positive ES cell clones were selected 
at a frequency of 1 clone/ 1 5-20x1 0 6 fused cells and were analyzed for YAC integrity by Southern and 
CHEF blot analyses (Fig. 2A). 

Seven of thirty-five ES cell clones (referred to as LI 0, J9.2, LI 7, LI 8, J 17, L22, L23) derived 
from ES cell fusion with yH2-containing yeast were found to contain all expected EcoRI and BamM 
yH2 fragments detected by probes spanning the entire insert: mouse 3' enhancer, human intronic 
enhancer, human C y 2, Q, and C M constant regions, D H , J H and all the different V H families: V„l, V H 2, 
V H 3, Vh4, V h 5, and V„6 (data shown for 5 clones in Fig. 2 A). CHEF analysis further confirmed that 
these clones, which represent 20% of all clones analyzed, contain the entire intact yH2 YAC with no 
apparent deletions or rearrangements (data not shown). 
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ES cell clones derived from the fusion of yK2-containing yeast were similarly analyzed for 
YAC integrity, using probes specific for the human Kde, kappa 3' and intronic enhancers, C K , J . and 
all of the different V r families: VJ, V r II, VJII, V K IV, V K VL Twenty clones of the sixty clones had 
intact and unaltered YAC, which represent 30% of total clones analyzed (data shown for two ES 
clones ir. Fig. 3 A). Varying amounts of yeast genomic sequences were detected in yH2 and yK2-ES 
cell clones (data not shown). 

These results are the first demonstration of introduction of megabase-sized constructs 
encompassing reconstructed human loci, predominantly in germline configuration, into mammalian 
cells. The relatively high frequency of intact YACs integrated into the mouse genome further 
validated the ES cell-yeast spheroplast fusion methodology as an effective approach for faithful 
introduction of large human genomic fragments into ES cells. 

Example 4: Generation ofXenoMouse II strains 

In order to generate mice from the YAC DNA containing ES cells, microinjection of 
blastocysts was conducted, followed by breeding. Thus, yH2- and yK2-bearing ES cell clones were 
expanded and microinjected into mouse C57BL/6J blastocysts (Green et al., 1994) and the chimeric 
males produced were evaluated for germline transmission. Offspring with transmitted YAC were 
identified by PCR analysis and the YAC integrity was confirmed by Southern blot analysis. In all 
transgenic mice analyzed the YAC was shown to be in intact form (Fig.2B, 3B). All seven 
microinjected yH2-ES clones and two out of eight yK2-ES clones were transmitted through the 
mouse germline. 

In order to generate mice that produced human antibodies to the exclusion of endogenous 
antibodies, yH2- or yK2-transgenic mice were bred with double-inactivated (DI) mouse strains. The 
DI mouse strains are homozygous for gene targeted-inactivated mouse heavy and kappa chain loci 
and thus are deficient in antibody production (Jakobovits et al., 1993b; Green et al., 1994). Two of 
the yH2- transgenic mouse strains L10 and J9.2, and one of the yK2-transgenic mouse strains, J23.1, 
were bred with DI mice to generate mice bearing YACs on an homozygous inactivated mouse heavy 
and kappa chain background (yH2;DI, and yK2;DI). Each of the yH2;DI transgenic strains were bred 
with the yK2;DI transgenic strain to generate two XenoMouse II strains, 2A-1 (L10;J23.1;DI) and 
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2A-2 (J9.2;J23.1;DI), respectively, containing both heavy and light chain YACs on homozygous DI 
background. L10 is fully homozygous and J9.2 and J23. 1 are in the process of being successfully 
bred to homozygosity. 

The integrity of the human heavy and kappa chain YACs in XenoMouse II strains was 
confirmed by Southern blot analysis. As shown in Fig. 2 and Fig. 3, in both XenoMouse strains 
analyzed, yH2 and yK2 were transmitted unaltered through multiple generations with no apparent 
deletions or rearrangements. 

Example 5: B-ceH deve lopment and human antibody production bv XenoMouse II mice 

In order to further characterize the XenoMouse II strains, we studied their B-cell development 
and their production of human antibodies. Reconstitution of B-cell development and antibody 
production in XenoMouse II strains by yH2 and yK2 YACs was evaluated by flow cytometry and 
ELISA. In contrast to DI mice, which completely lack mature B-cells, XenoMouse II manifested 
essentially normal B-cell development with the mature B-cell population in the blood totaling over 
50% of the level seen in wild type mice (Fig. 4A). All B-cells were shown to express human IgM and 
high levels of B220 (human IgM*/B22tf li ) > with 60% of this population also expressing human IgD. 
Similar results were obtained from analysis of XenoMouse spleen and lymph nodes (not shown). 
These results correlate well with the characteristics of mature B-cells in wild type mice, indicating 
proper B-cell maturation in XenoMouse. 

The majority of XenoMouse B-cells (75-80%) expressed exclusively human kappa (K)Iight 
chain, whereas only about 15% expressed mouse lambda (X) light chain (Fig. 4). This light chain 
distribution ratio (hic/mA: 75:15) is comparable to that observed in wild type mice, indicating a 
mouse-like regulation of light chain utilization. In contrast, XenoMouse I, as described in Green et 
al., 1994, showed a ratio of hic/mA: 55:45 (data not shown). Similar observations were made for B- 
cells from spleen (Fig. 4B) and lymph nodes (not shown), indicating that most of XenoMouse IPs 
B-cells produced exclusively fully human antibodies. Levels of mX -expressing B-cells were reduced 
from 15% to 7% in XenoMouse II strains homozygous for yK2 (data not shown). 
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Example 6 Generation o/L6 Strain 

The L6 strain of mice were generated identically to the process described above in connection 
with the generation of the XenoMouse II strains. However, owing to a deletion event during the 
5 generation of the L6 ES cell line, the ES cell line, and, subsequently, the L6 mouse evolved without 
a portion of the sequence distal to CS, thus, eliminating the Cy constant region and its regulatory 
sequences. Following completion of breeding, the L6 mice will contain the entire yK2 construct and 
the entire yH2 construct, except for the missing Cy constant region. 

10 Example 7: Human Antibody Production 

Expression of human Cji, Cy2, and k light chains were detected in unimmunized XenoMouse 
II sera at maximal levels of 700, 600, and 800 ng/ml, respectively. To determine how these values 
compared to wild-type, we measured maximal levels of mouse Qi, Cy2, and k light chains in 
C57BL/6J x 129 mice kept under similar pathogen-free conditions. The values for Cn, Cy2, and k 

15 light chain in wild-type mice were 400, 2000, and 2000 ng/ml, respectively. Upon immunization, the 
human y chain levels increased to approximately 2.5 mg/ml. The concentration of mouse X was only 
70 ng/ml, further confirming the preferential use of human kappa chain. 

These findings confirmed the ability of the introduced human Ig YACs to induce proper Ig 
gene rearrangement and class switching and to generate significant levels of fully human IgM and IgG 

20 antibodies before and after immunization. 

Example 8: A diverse human antibody repertoire in XenoMouse II 

In order to further understand the reconstitution of the antibody repertoire in XenoMouse II 
strains, we challenged mice with several antigens, and prepared hybridoma cell lines secreting such 
25 antibodies. As will be understood, recapitulation of the human antibody response in mice requires 
diverse utilization of the different human variable genes contained on yH2 and yK2 YACs. The 
diversity of the human antibodies generated by XenoMouse II strains was determined by cloning and 
sequencing human heavy chain (\i and y) and kappa light chain transcripts from XenoMouse lymph 
nodes. Based upon our data to date, sequence analysis demonstrates that XenoMouse II utilizes at 
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least 1 1 out of the 37 functional V H genes present on yH2, eight different D H segments and three J H 
genes (J m , J H4 , J^) (Table III; J H5 was also detected in connection with our sequencing antibodies 
from hybridomas). V-D-J sequences were linked to human \x or y2 constant regions (not shown). 
The V H genes utilized are widely distributed over the entire variable region and represent four 
5 out of the seven V H families (Table III). The predominant utilization of V genes from and V H4 
families is similar to the V H usage pattern in adult humans, which is proportional to family size 
(Yamada et al. 1991; Brezinshek et al., 1995). The predominant usage of J H4 is also reminiscent of 
that detected in human B-cells (Brezinshek et al, 1995). Addition of non-germline nucleotides 
(N-additions) at both V-D and D-J joinings, ranging from 1-12 bp, were also observed. Such N- 
10 additions produced complementary determining regions 3 (CDR3s) with lengths of from 8 to about 
19 amino acid residues, which is very comparable to that observed in adults human B-cells (Yamada 
et al. 1991 ; Brezinshek et al., 1995). Such CDR3 lengths observed in the XenoMouse II are much 
longer than CDR3 lengths ordinarily observed in mice (Feeny, 1990). 

A highly diverse repertoire was also found in the ten kappa chain transcripts sequenced. In 
15 addition to displaying 8 out of the 25 Vk functional open reading frames (ORFs) present on yK2, all 
of the Jk genes were detectable (Table IV). The different Vk genes utilized were widely dispersed 
throughout yK2, representing all four major Vk gene families. All VkJk recombination products 
were linked properly to Ck sequences. The paucity of N-additions in our transcripts is in agreement 
with the greatly reduced terminal deoxynucleotide transferase activity at the stage of kappa chain 
. 20 rearrangement. The average CDR3 length of 9-1 0 amino acids that we observed in the kappa chain 
transcripts is identical to that observed in human B-cells (Marks et al., 1991). 

In Tables III and IV below, repertoire analyses of human heavy and kappa light chain 
transcripts expressed in XenoMouse II strains are presented. Human n, y, and k specific mRNAs 
were amplified by PCR, cloned and analyzed by sequencing as described in Materials and Methods. 
25 Table III shows a series of nucleotide sequences of 12 unique human heavy chain clones, divided into 
V ft D, J H and N segments, as identified by homology with published germline sequences (Materials 
and Methods). Each D segment assignment is based on at least 8 bases of homology. Table IV 
shows a series of nucleotide sequences of V-J junctions of 8 independent human k clones. The 
sequences are divided into V^, and N segments and identified based on homology to published V K 
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and J K sequences. In each of the Tables N-additions and deletions (indicated as _) were determined 
by their lack of sequence homology to V, D, or J sequences. 



TABLE m 

Repertoire Analysis of Human Heavy Chain Transcripts 



10 



15 



20 



Clone 


v„ 


N 


D M 


N 


>* 


A2.2.1 


5-51 (DP73) 
TTACTGTGCGAGACA 


4 

(TAGG) 


XP5rc 
AATCAT 


12 

(GGGAGCTACGGG) 


JH4 GACTACTGGGGC 


B2.1.5 


3-33 (DP-50) 
TTACTGTGCGAGAGA 


7 

(TCGGGGA) 


3rc 
AATAGCA 


7 

(CTGGCCT) 


JH4 CTTTGACTACTGGGGC 


B4.2.4 


3-15(DP-38) 
TTACTGTACCACAGA 


1 

(G) 


Kl 
GGCTAC 


11 

(ACTAACTACCC) 


JH6 CTACTACTACTACGGT 


B4.2.5 


4-59(DP-7I) 
TTACTGTGCGAGAGA 


10 

(TAGGAGTGTT) 


4 

GTAGTACCAGCTGCTAT 


6 

(ACCCAA) 


JH6 ACTACTACTACTACGGT 


D2.2.5 


4-34 (DP-63) 
TTACTGTGCGAGAG_ 


2 

(GG) 


Nl * 

GCAGCAGCTG 


4 

(CCCT) 


JH4 CTTTGACTACTGGGGC 


D2.1.3 


3-48 (DP-51) 
TTACTGTGCGAGAGA 


4 

(TCTT) 


XP1 

GATATTTTGACTGGT 


2 

(CT) 


JH6 CTACTACTACTACGGT 


D2.2.8 


4-31 (DP-65) 
TTACTGTGCGAGAGA 


2 

(GA) 


A4 

GACTGCAG 


5 

(CGGTT) 


JH4 TTTGACTACTGGGGC 


A2.2.4 


3-21 (DP-77) 
TTACTGTGCGAGAGA 


2 

(TV) 


IR3 
GGGGCTGG 


3 

(ACC) 


JH6 _T ACTACTACTACTACGGT 


D4.2.1I 


4-4/4.35 
ATTACTGTGCGA 


1 

(A) 


Nl 

TATAGCAGTGGCTGGT 


2 

(GT) 


JH4 CTTTGACTACTGGGGC 


Cl.2.1 


1-18(DP-14) 
TATTACTGTGCGAG_ 


0 


XPM/2I-7 
GTTA 


0 


JH4 GACTACTGGGGC 


C3.1.2 


4-39 (DP-79) 
TATTACTGTGCG 


3 

(GCC) 


2 

GGATATAGTAGTGG 


6 

(TCGGGC) 


JH4 CTTTGACTACTGGGGC 


D2.2.7 


5-51 (DP73) 
TTACTGTGCGAGACA 


4 

fTGGO 


Kl 

AGTGGCT 


9 

fGGTACTCTG) 


JH3 ATGCTTTGATATCTGGGG 
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TABLE TV 

Repertoire Analysis of Human Kappa Li ght Chain Transcripts 



Clone 


Vic 


N 


Jjc 


F2.2.3 


02(DPK9) 
TTAAACGAACAGTACCCC 


0 


Jk5 GATCACCTTCGGCCAA 


F4.1.8 


L5 (DPK5) 
ACAGGCTAACAGTTTCCCTC_ 


0 


JkI _GGACGTTCGGCCAA 


F4.1.6 


A20 (DPK4) 
AAGTATAACAGTGCCCC 


0 


Jpc3 ATTCACTTTCGGCCCT 


F2.2.5 


08 

ACAGTATGATAATCTCCC 


0 


Jk4 GCTC ACTTTCGGCGGA 


F2.1.3 


LI 

AAAGTATAATAGTTACCC 


0 


Jk5 GATCACCTTCGGCCAA 


F2.I.4 


A30 

CAGCATAATAGTTACCC 


0 


Jk3 ATTCACTTTCGGCCCT 


F2.1.3 


B3(DPK24) 
AATATTATAGTACTCC 


0 


Jic4 GCTCACTTTCGGCGGA 


F4.1.3 


A27(DPK22) 
CAGTATGGTAGCTCACCTC 


1 

(G) 


Jk2 CACTTTTGGCCAG 



These results, together with sequences of XenoMouse-derived hybridomas described later, 
demonstrate a highly diverse, adult human-like utilization of V, D, and J genes, which appears to 
demonstrate that the entire human heavy and kappa chain variable regions present on the yH2 and 
theyK2 YACs are accessible to the mouse system for antibody rearrangement and are being utilized 
in a non-position-biased manner. In addition, the average length of N-additions and CDRJs for both 
the heavy and kappa chain transcripts, is very similar to that seen in adult human B-cells, indicating 
that the YAC DNA contained in the mice direct the mouse machinery to produce an adult human-like 
immune repertoire in mice. 

In connection with the following Examples, we prepared high affinity antibodies to several 
antigens. In particular, antigens were prepared to human IL-8 and human EGFR. The rationale for 
the selection of IL-8 and EGFR is as follows. 

LL-8 is a member of the C-X-C chemokine family. IL-8 acts as the primary chemoattractant 
for neutrophils implicated in many diseases, including ARDS, rheumatoid arthritis, inflammatory 
bowel disease, glomerulonephritis, psoriasis, alcoholic hepatitis, reperfiision injury, to name a few. 
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Moreover, IL-8 is a potent angiogenic factor for endothelial cells. In Figures 22-28, we demonstrate 
that human anti-IL-8 ant" odies derived from XenoMouse II strains are effective in a inhibiting IL-8's 
actions in a number of pathways. For example, Figure 22 shows blockage of IL-8 binding to human 
neutrophils by human anti-IL-8. Figure 23 shows inhibition of CDllb expression on human 
neutrophils by human anti-IL-8. Figure 24 shows inhibition of EL-8 induced calcium influx by human 
anti-IL-8 antibodies. Figure 25 shows inhibition of DL-8 RB/293 chemotaxsis by human anti-DL-8 
antibodies. Figure 26 is a schematic diagram of a rabbit model of human IL-8 induced skin 
inflammation. Figure 27 shows the inhibition of human EL-8 induced skin inflammation in the rabbit 
model of Figure 26 with human anti-IL-8 antibodies. Figure 28 shows inhibition of angiogenesis of 
endothelial cells on a rat corneal pocket model by human anti-IL-8 antibodies. 

EGFR is viewed as an anti-cancer target. For example, EGFR is overexpressed, up to 100 
fold, on a variety of cancer cells. Ligand (EGF and TNF) mediated growth stimulation plays a critical 
role in the initiation and progression of certain tumors. In this regard, EGFR antibodies inhibit ligand 
binding and lead to the arrest of tumor cell growth, and, in conjunction with chemotherapeutic agents, 
induces apoptosis. Indeed, it has been demonstrated that a combination of EGFR Mabs resulted in 
tumor eradication in murine xenogeneic tumor models. Imcione has conducted Phase I clinical 
utilizing a chimeric Mab (C225) that proved to be safe. In Figures 31-33, we demonstrate data 
related to our human anti-EGFR antibodies. Figure 30 shows heavy chain amino acid sequences of 
human anti-EGFR antibodies derived from XenoMouse II strains. Figure 3 1 shows blockage EGF 
binding to A43 1 cells by human anti-EGFR antibodies. Figure 32 shows inhibition of EGF binding 
to SW948 cells by human anti-EGFR antibodies. Figure 33 shows that human anti-EGFR antibodies 
derived from XenoMouse II strains inhibit growth of SW948 cells in vitro. 

Example 9: Hieh affinity, antigen-spe cific human Mabs produced hv XenoMouse II 

We next asked whether the demonstrated utilization of the large human repertoire in 

XenoMouse II could be harnessed to generate human antibodies to multiple antigens, in particular, 

human antigens of significant clinical interest. 

Accordingly, individual XenoMouse II pups were challenged each with one of three different 

antigen targets, human IL-8, human EGFR and human TNF-a. Antigens were administered in two 
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different forms, either as soluble protein, in the case of IL-8 and TNF-oc or expressed on the surface 
of cells (A43 1 cells), in tU e case of EGFR. For all three antigens, ELISAs performed on sera from 
immunized mice indicated a strong antigen-specific human antibody (IgG, IgK) response with titers 
as high as 1:3x1 0 6 . Negligible mouse A response was detected. 
5 Hybridomas were derived from spleen or lymph node tissues by standard hybridoma 

technology and were screened for secretion of antigen-specific human Mabs by ELISA. 

An IL-8 immunized XenoMouse II yielded a panel of 12 hybridomas, all secreting fully human 
(hIgG 2 ic) Mabs specific to human IL-8. Antibodies from four of these hybridomas, Dl . 1 , K2.2, K4.2, 
and K4.3, were purified from ascitic fluid and evaluated for their affinity for human IL-8 and their 
10 potency in blocking binding of IL-8 to its receptors on human neutrophils. 

Affinity measurements were performed by solid phase measurements of both whole antibody 
and Fab fragments using surface plasmon resonance in BIAcore and in solution by radioimmunoassay 
(Materials and Methods). As shown in Table V, affinity values measured for the four Mabs ranged 
from l.lxlO 9 to 4.8xl0 10 M' 1 . While there was some variation in the techniques employed, affinity 
1 5 values for all four antibodies were consistently higher than 1 0 9 M* 1 . 

ELISA analysis confirmed that these four antibodies were specific to human IL-8 and did not 
cross-react with the closely related chemokines MlP-la, GROa, P, and y, ENA-78, MCP-1, or 
RANTES (data not shown). Further, competition analysis on the BIAcore indicated that the 
antibodies recognize at least two different epitopes (data not shown). All antibodies inhibit IL-8 
20 binding to human neutrophils as effectively as the murine anti-human IL-8 neutralizing antibody, 
whereas a control human IgG 2 K antibody did not (Fig. 5A). 

Fusion experiments with EGFR-immunized Xenomouse II yielded a panel of 25 hybridomas, 
all secreting EGFR-specific human IgG 2 K Mabs. Of the thirteen human Mabs analyzed, four (E2. 1, 
E2.4, E2.5, E2.1 1) were selected for their ability to compete with EGFR-specific mouse antibody 
25 225, which has previously been shown to inhibit EGF-mediated cell proliferation and tumor formation 
in mice (Sato et aL, 1983). These human antibodies, purified from ascitic fluid, were evaluated for 
their affinity for EGFR and neutralization of EGF binding to cells. The affinities of these antibodies 
for EGFR, as determined by BIAcore measurements, ranged from 2.9xl0 9 to 2.9xl0 10 M* 1 (Table V). 
All four anti-EGFR antibodies completely blocked EGF binding to A43 1 cells (Fig. 5B), 
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demonstrating their ability to neutralize its binding to both high and low affinity receptors on these 
cells (Kawamoto et al., 1983). Complete inhibition of EGF binding to EGFR expressed on h-man 
SW948 human lung carcinoma cells by all four anti-EGFR human antibodies was also observed (data 
not shown). In both cases, the fully human antibodies were as effective in inhibition of EGF binding 
as the ar.ti EGFR mouse antibody 225 and more potent than the 528 antibody (Gill et al., 1983). In 
both cell assays, a control human IgG 2 K antibody did not affect EGF binding (Fig. 5B and data not 
shown). 

Fusion experiments with TNF-a immunized Xenomouse II yielded a panel of 12 human IgG 2 K 
antibodies. Four out of the 12 were selected for their ability to block the binding of TNF-a to its 
receptors on U937 cells (Fig. 5C). The affinities of these antibodies were determined to be in the 
range of 1.2-3. 9xl0 9 M' 1 (Table V). 

The described Xenomouse-derived hybridomas produced antibodies at concentrations in the 
range of 2-19 fig/ml in static culture conditions. Characterization of the purified antibodies on protein 
gels under non-reducing conditions revealed the expected apparent molecular weight of 1 50 kD for 
the IgG 2 K antibody. Under reducing conditions the expected apparent molecular weights of 50 kD 
for the heavy and 25 kD for the light chain were detected (data not shown). 

Table V, below, shows affinity constants of XenoMouse-derived antigen-specific fully human 
Mabs. The affinity constants of XenoMouse-derived human IgG 2 K Mabs specific to IL-8, EGFR, and 
TNF-a were determined by BIAcore or by radioimmunoassay as described in Materials and Methods. 
The values shown for BL-8 and EGFR are representative of independent experiments carried out with 
purified antibodies, while the values shown for TNF-a are from experiments carried out with 
hybridoma supernatants. 
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TABLE V 



- Human 
Mab 
IgG>K 


Antigen 


ka (M'S 1 ) 


kd(S-») 


KA(M») 


KD(M) 


Surface 
Density 
fRUI* 


Radio 
Immunoassay 

V iVi ) 


lllilllll 


. 




Solid Phase Measurements 


Solution 


D1.I 


IL-8 


2.7 xlO 6 


9.9 x 10' 4 


2.7 x IO 9 


3.7 x IO* 10 


81 


2.0 x IO 10 


Dl.l Fab 


IL-8 


2.1 x 10 6 


2.1 x 10° 


1.1 xlO 9 


8.8 x IO" 10 


81 


4.9 x 10 n 


K2.2 


IL-8 


0.9 xlO 6 


2.3 x IO* 4 


4.0 xlO 9 


2.5 x IO' 10 


81 


1.0x10'° 


K4.2 


IL-8 


2.5 x 10 6 


4.1 xlO* 4 


6.3 x 10 9 


1.6 x IO" 10 


81 


ND 


K4.3 


IL-8 


4.3 x 10 6 


9.4 x IO' 4 


4.5 x 10 9 


2.2 x IO* 10 


81 


2.1 x 10" 


K4.3 Fab 


IL-8 


6.0 x 10 6 


2.1 x 10° 


2.9 x 10 9 


3.4 x IO 10 


81 


















ELISA (M) 


El.l 


EGFR 


1.9 x IO 6 


6.5 x 10' 4 


2.9 x 10 9 


3.46 x IO* 10 


303 


1.1 x IO 10 


E2.5 


EGFR 


2.1 x 10 6 


1.8 x 10' 4 


1.2 x IO 10 


8.44 x 10' u 


303 


3.6 x IO 10 


E2.ll 


EGFR 


1.7 x 10 6 


4.7 x IO" 4 


3.7 xlO 9 


2.68 x IO" 10 


303 


1.1 x IO' 10 


E2.4 


EGFR 


2.8 x 10 6 


9.78 x IO' 5 


2.9 x IO 10 


3.5x10" 


818 


I.I x IO 10 


















T22.I 


TNF-cc 


1.6 x 10 6 


1.3x10° 


1.2 xlO 9 


8.06 x IO* 10 


107 




T22.4 


TNF-a 


2.4 x !0 6 


4.6 x IO" 4 


5.3 x IO 9 


1.89 x IO* 10 


107 




T22.8 


TNF-a 


1.7 x 10 6 


7.5 x IO' 4 


2.3 x 10 9 


4.3 x 10"'° 


107 




T22.9 


TNF-a 


2.3 x 10 6 


4.9 x IO* 4 


4.8 x 10 9 


2.1 1 x IO 10 


107 




T22.il 


TNF-a 


2.9 x 10 6 


7.9 x IO" 4 


N/A 


2.76 xlO 10 


107 





Example 10: Gene usag e and somatic hvpermutation in monoclonal antibodies 

The sequences of the heavy and kappa light chain transcripts from the described IL-8 and 
EGFR-human Mabs were determined Figure 6 and Figures [[ ]]. The four IL-8-specific antibodies 
consisted of at least three different V„ genes (V H4 . 34 /V H4 „ 21 , V m _ 30 and V H5 . S1 ), four different D„ 
segments (A1/A4, Kl, ir3rc, and 21-10rc) and two J H (J^ and J H4 ) gene segments. Three different 
Vk genes (012, 01 8, and B3) combined with Jk3 and Jk4 genes. Such diverse utilization shows that 
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Xenomouse II is capable of producing a panel of anti-IL-8 neutralizing antibodies with diverse 
variable regions. 

In contrast to the IL-8 antibody transcripts, the sequences of antibodies selected for their 
ability to compete with Mab 225 showed relatively restricted V H and Vk gene usage, with three 
antibodies, ELI, E2.4 and E2.5 sharing the same V H gene (4-3 1 ) and E2. 1 1 containing V H4h6 „ which 
is highly homologous to V H4 . 31 . Different D (2, A1/A4, XP1) and J H (J H 3, J H 4, J H 5) segments were 
detected. All four antibodies were shown to share the same Vk (018) gene. Three of them 
contained Jk4, and one, E2.5, contained Jk2. 

Most V H and Vk hybridoma transcripts showed extensive nucleotide changes (7-17) from the 
corresponding germline segments, whereas no mutations were detected in the constant regions. Most 
of the mutations in V segments resulted in amino acid substitutions in the predicted antibody amino 
acid sequences (0-12 per V gene), many in CDRI and CDR2 regions (Figure J. Of note are the 
mutations which are shared by the heavy chain sequences of EGFR antibodies, such as the Gly- Asp 
substitution in CDRI, shared by all antibodies, or Ser-Asn substitution in CDR2 and Val-Leu in the 
framework region 3 shared by three antibodies. These results indicated that an extensive process of 
somatic hypermutation, leading to antibody maturation and selection, is occurring in Xenomouse II. 

Discussion 

This present application describes the first functional substitution of complex, megabase-sized 
mouse loci, with human DNA fragments equivalent in size and content reconstructed on YACs. With 
this approach, the mouse humoral immune system was "humanized" with megabase-sized human Ig 
loci to substantially reproduce the human antibody response in mice deficient in endogenous antibody 
production. 

Our success in faithful reconstruction of a large portion of the human heavy and kappa light 
chain loci, nearly in germline configuration, establishes YAC recombination in yeast as a powerful 
technology to reconstitute large, complex and unstable fragments, such as the Ig loci (Mendez et al., 
1995), and manipulate them for introduction into mammalian cells. Furthermore, the successful 
introduction of the two large heavy and kappa light chain segments into the mouse germline in intact 
form confirms the methodology of ES cell-yeast spheroplast fusion as a reliable and efficient approach 
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to delivering xenogeneic loci into the mouse germline. 

Characterization of Xenomouse II strains has shown that the large Ig loci were capable of 
restoring the antibody system, comparable in its diversity and functionality to that of wildtype mice, 
and much superior to the humoral response produced in mice bearing human Ig minigene constructs 
5 (Lonberg et al., 1994) or small human Ig YACs (Green et al., 1 994). This difference was manifested 
in the levels of mature B-cells, human Ig production, class switching efficiency, diversity, 
preponderance of human IgK over murine IgA production, and magnitude of the human antibody 
response, and success in the generation of high affinity, antigen-specific monoclonal antibodies to 
multiple antigens. 

10 The levels of mature B-cells and human antibodies in Xenomouse II are the highest yet 

reported for Ig-transgenic mice, representing a several-fold increase over the levels shown for 
previous mice and approaching those of wildtype mice. In particular, the levels of the human IgG 
were more than 100 fold higher than those reported for mice bearing minilocus Ig transgenes with 
human yl gene (Lonberg et al., 1994). The more efficient class switching in Xenomouse II was likely 

15 the result of the inclusion of the entire switch regions, with all of their regulatory elements, as well 
as the additional control elements on yH2, which may be important to support and maintain proper 
class switching. The elevated levels of mature B-cells in Xenomouse II strains are likely to result 
from the higher rearrangement frequency and thus improved B-cell development in the bone marrow 
due to the increased V gene repertoire. B-cell reconstitution is expected to be even more pronounced 

20 in XenoMouse II strains that are homozygous for the human heavy chain locus. 

The ratio of human k to mouse A light chain expression by circulating B-cells provides a 
useful internal measure of the utilization of the transgenic kappa chain locus. Whereas in mice 
containing one allele of smaller Ig YACs, an approximately equal distribution of human k and mouse 
k was observed, a significant preponderance of human k was detected in Xenomouse II strains. 

25 Moreover, in animals homozygous for yK2 possessed a k\X ratio that is identical to wild type mice. 
These observations together with the broad Vic gene usage strongly suggest that the human proximal 
Vk genes in the Xenomouse II are sufficient to support a diverse light chain response and are 
consistent with the bias toward proximal Vk gene usage in humans (Cox et al., 1994). 

Xenomouse II strains exhibited highly increased antibody diversity with V, D, and J genes 
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across the entire span of the loci accessed by the recombination mechanism and incorporated into 
mature antibodies. Once triggered by antigen binding, extensive somatic hypermutation occurs, 
leading to affinity maturation of the antibodies. 

The utilization pattern of V, D, J genes in Xenomouse II also indicated they are available and 
utilized in a manner reminiscent of their utilization in humans, yielding an adult-like human antibody 
repertoire, which is different from the fetal-like, position-biased usage observed in Ig 
minigene-bearing mice (Taylor et al., 1992; Taylor et al., 1994; Tuaillon et al., 1993). The broad 
utilization of many of the functional V H and Vk genes together with the multiplicity of antigens 
recognized by the mice underscores the importance of the large V gene repertoire to successfully 
reconstituting a functional antibody response. 

The ultimate test for the extent of reconstitution of the human immune response in mice is the 
spectrum of antigens to which the mice will elicit an antibody response and the ease with which 
antigen-specific high affinity Mabs can be generated to different antigens. Unlike mice engineered 
with smaller human Ig YACs or minigenes, which yielded to date only a limited number of 
antigen-specific human Mabs (Lonberg et al., 1994; Green et al., 1994; Fishwild et al., 1996), 
Xenomouse II generated Mabs to all human antigens tested to date. Xenomouse II strains mounted 
a strong human antibody response to different human antigens, presented either as soluble proteins 
or expressed on the surfaces of cells. Immunization with each of the three human antigens tested 
yielded a panel of 10-25 antigen-specific human IgG 2 K Mabs. For each antigen, a set of antibodies 
with affinities in the range of 10 9 -10 10 M" 1 was obtained. Several measures were taken to confirm that 
the affinity values represent univalent binding kinetics rather than avidity: BIAcore assays with intact 
antibodies were carried out with sensor chips coated at low antigen density to minimize the 
probability of bivalent binding; for two antibodies, the assay was repeated with monovalent Fab 
fragments; some of the antibodies were also tested by solution radioimmunoassay. From the results 
of these measurements, we conclude that antibodies with affinities in the range of 10 10 M" 1 are readily 
attainable with the XenoMouse. The affinity values obtained for XenoMouse-derived antibodies are 
the highest to be reported for human antibodies against human antigens produced from either 
engineered mice (Lonberg et al., Fishwild et al., 1996) or from combinatorial libraries (Vaughan et 
al., 1996). These high affinities combined with the extensive amino acid substitution as a result of 
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somatic mutation in the V genes confirms that the mechanism of affinity maturation is intact in 
Xenomouse II and comparable to that in wildtype mice. 

These results show that the large antibody repertoire on the human Ig YACs is being properly 
exploited by the mouse machinery for antibody diversification and selection, and, due to the lack of 
immunological tolerance to human proteins, can yield high affinity antibodies against any antigen of 
interest, including human antigens. The facility with which antibodies to human antigens can be 
generated by the human immunoglobulin genes in these mice provides further confirmation that self 
tolerance at the B-cell level is acquired and not inherited. 

The ability to generate high affinity fully human antibodies to human antigens has obvious 
practical implications. Fully human antibodies are expected to minimize the immunogenic and allergic 
responses intrinsic to mouse or mouse-derivatized Mabs and thus to increase the efficacy and safety 
of the administered antibodies. Xenomouse II offers the opportunity of providing a substantial 
advantage in the treatment of chronic and recurring human diseases, such as inflammation, 
autoimmunity, and cancer, which require repeated antibody administrations. The rapidity and 
reproducibility with which XenoMouse II yields a panel of fully human high affinity antibodies 
indicates the potential advance it offers over other technologies for human antibody production. For 
example, in contrast to phage display, which requires intensive efforts to enhance the affinity of many 
of its derived antibodies and yields single chain Fvs or Fabs, Xenomouse II antibodies are high affinity 
fully intact immunoglobulins which can be produced from hyliridomas without further engineering. 

The strategy described here for creation of an authentic human humoral immune system in 
mice can be applied towards humanization of other multi-gene loci, such as the T ceil receptor or the 
major histocompatibility complex, that govern other compartments of the mouse immune system 
(Jakobovits, 1994). Such mice would be valuable for elucidating the structure-function relationships 
of the human loci and their involvement in the evolution of the immune system. 

Incorporation By Reference 
All references cited herein, including patents, patent applications, papers, text books, and the 
like, and the references cited therein, to the extent that they are not already, are hereby incorporated 
herein by reference in their entirety. In addition, the following references are also incorporated by 
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SEQUENCE LISTING 



1 ) GENERAL INFORMATION 

(i) APPLICANT: Abgenix, Inc. 

(ii) TITLE OF THE INVENTION: TRANSGENIC MAMMALS HAVING HUMAN IG LOCI 
INCLUDING PLURAL VH AND VK ... 

(in) NUMBER OF SEQUENCES: 80 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Fish & Neave 

(B) STREET: 1251 Avenue of the Americas 

(C) CITY: New York 

(D) STATE: NY 

(E) COUNTRY: USA 

(F) ZIP: 10020 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ Version 1.5 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 03-DEC-l 997 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/759,620 

(B) FILING DATE: 03-DEC- 1 996 



(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: James, Haley F 

(B) REGISTRATION NUMBER: 27,794 

(C) REFERENCE/DOCKET NUMBER: Cell 4.18 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 212-596-9000 

(B) TELEFAX: 212-596-9090 

(C) TELEX: 



(2) INFORMATION FOR SEQ ID NO: 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1 : 
CAGGTGCAGC TGGAGCAGTC GG 22 
(2) INFORMATION FOR SEQ ID NO:2: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 
GCTGAGGGAG TAGAGTCCTG AGGA 24 
(2) INFORMATION FOR SEQ ID NO:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 
TTACTGTGCG AGACA 1 5 

(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 
GGGAGCTACGGG 12 
(2) INFORMATION FOR SEQ ED NO:5: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 
GACTACTGGGGC 12 
(2) INFORMATION FOR SEQ ID NO:6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: 
TTACTGTGCG AGAG A 1 5 

(2) INFORMATION FOR SEQ ID NO:7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
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(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 



(2) INFORMATION FOR SEQ ID NO:8: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 
TTACTGTACC ACAGA 15 
(2) INFORMATION FOR SEQ ID NO:9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 
ACTAACTACCC 11 

(2) INFORMATION FOR SEQ ED NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 



CTTTGACTAC TGGGGC 



16 
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(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



(2) INFORMATION FOR SEQ ED NO: 1 1 : 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ED NO: 1 1 : 
TTACTGTGCG AGAGA 15 
(2) INFORMATION FOR SEQ ID NO:12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
TAGGAGTGTT 10 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 



CTACTACTAC TACGGT 



16 



-58- 



WO 98/24893 





PCT/US97/23091 



(vi) ORIGINAL SOURCE: 

(xi) SEQUENCI DESCRIPTION: SEQ ID NO: 13: 



(2) INFORMATION FOR SEQ ID NO: 14: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
ACTACTACTA CTACGGT 1 7 

(2) INFORMATION FOR SEQ ID NO: 1 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 4 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQIDNO:15: 
TTACTGTGCG AGAG 14 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 



GTAGTACCAG CTGCTAT 



17 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



(2) INFORMATION FOR SEQ ID NO:17: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
CTTTGACTAC TGGGGC 16 
(2) INFORMATION FOR SEQ ID NO:18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
TTACTGTGCG AGAGA 1 5 

(2) INFORMATION FOR SEQ ID NO: 1 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 



GCAGCAGCTG 



10 
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GATATTTTGA CTGGT 



15 



(2) INFORMATION FOR SEQ ID NO:20: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 
CTACTACTAC TACGGT 16 
(2) INFORMATION FOR SEQ ID NO:21 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21 : 
TTACTGTGCG AGAGA 1 5 

(2) INFORMATION FOR SEQ ID NO:22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 
TTTC ACT ACT GGGGC ] 5 
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(2) INFORMATION FOR SEQ ID NO:23: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 
TTACTGTGCG AGAGA 1 5 

(2) INFORMATION FOR SEQ ID NO:24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 
TACTACTACT ACTACGGT 1 8 

(2) INFORMATION FOR SEQ ED NO:25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 
ATTACTGTGCGA 12 
(2) INFORMATION FOR SEQ ID NO:26: 
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(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE.NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 
TATAGCAGTG GCTGGT 1 6 

(2) INFORMATION FOR SEQ ID NO:27: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 
CTTTGACTAC TGGGGC 16 

(2) INFORMATION FOR SEQ ID NO:28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 
TATTACTGTG CGAG 14 

(2) INFORMATION FOR SEQ ED NO:29: 
(i) SEQUENCE CHARACTERISTICS: 



-63 - 



WO 98/24893 ^ ^ PCT/US97/23091 



(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(hi) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 
GACTACTGGGGC 12 
(2) INFORMATION FOR SEQ ID NO:30: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:30: 
TATTACTGTGCG 12 
(2) INFORMATION FOR SEQ ID NO:3 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3 1 : 
GGATATAGTA GTGG 14 

(2) INFORMATION FOR SEQ ID NO:32: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 16 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 
CTTTGACTAC TGGGGC 16 
(2) INFORMATION FOR SEQ ID NO:33: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: . 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 
TTACTGTGCG AGACA 15 
(2) INFORMATION FOR SEQ ID NO:34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:34: 
ATGCTTTGAT ATCTGGGG 1 8 

(2) INFORMATION FOR SEQ ID NO:35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:35: 
TTAAACGAAC AGTACCCC 1 8 

(2) INFORMATION FOR SEQ ID NO:36: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:36: 
GATCACCTTC GGCCAA 1 6 

(2) INFORMATION FOR SEQ ED NO;37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:37: 
ACAGGCTAAC AGTTTCCCTC 20 

(2) INFORMATION FOR SEQ ID NO:38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 4 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY; linear 

(ii) MOLECU1 n TYPE: cDNA 

(iii) HYPOTHEflCAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:38: 
GGACGTTCGG CCAA 14 
(2) INFORMATION FOR SEQ ID NO:39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
P) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:39: 
AAGTATAACA GTGCCCC 1 7 

(2) INFORMATION FOR SEQ ID NO:40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:40: 
ATTC ACTTTC GGCCCT 1 6 

(2) INFORMATION FOR SEQ ID NO:4 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4 1 : 
ACAGTATGAT AATCTCCC 18 
(2) INFORMATION FOR SEQ ID NO:42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:42: 
GCTCACTTTC GGCGGA 16 
(2) INFORMATION FOR SEQ ID NO:43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:43: 
AAAGTATAAT AGTTACCC 1 8 

(2) INFORMATION FOR SEQ ID NO:44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:44: 
GATCACCTTC GGCCAA 16 
(2) INFORMATION FOR SEQ ID NO:45: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:45: 
CAGCATAATA GTTACCC 1 7 

(2) INFORMATION FOR SEQ ID NO:46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:46: 
ATTCACTTTC GGCCCT 16 

(2) INFORMATION FOR SEQ ED NO:47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 



-69- 





PCT/US97/23091 



WO 98/24893 



(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:47: 



(2) INFORMATION FOR SEQ ID NO:48: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:48: 
GCTCACTTTC GGCGGA 1 6 

(2) INFORMATION FOR SEQ ID NO:49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:49: 
CAGTATGGTA GCTCACCTC 1 9 

(2) INFORMATION FOR SEQ ID NO:50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 



AATATTATAG TACTCC 



16 
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(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:50: 
CACTTTTGGCCAG 

(2) INFORMATION FOR SEQ ID NO:51 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:51: 
Ala Ser Thr Lys Gly Pro Ser Val Phe Pro Leu Ala Pro Cys Ser Arg 



Ser Thr Ser Thr 
20 

(2) INFORMATION FOR SEQ ID NO:52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 80 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:52: 

Leu Ser Leu Thr Cys Ala Val Tyr Gly Gly Ser Phe Ser Gly Tyr Tyr 
1 5 10 15 

Trp Ser Trp lie Arg Gin Pro Pro Gly Lys Gly Leu Glu Tip He Gly 

20 25 30 

Glu He Asn His Ser Gly Ser Thr Asn Tyr Asn Pro Ser Leu Lys Ser 

35 40 45 

Arg Val Thr He Ser Val Asp Thr Ser Lys Asn Gin Phe Ser Leu Lys 

50 55 60 

Leu Ser Ser Val Thr Ala Ala Asp Thr Ala Val Tyr Tyr Cys Ala Arg 
65 70 75 80 



5 



10 



15 
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(2) INFORMATION FOR SEQ ID NO:53: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 1 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDED NESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:53: 

Leu Ser Leu Thr Cys Ala Val Tvr Gly Gly Ser Phe Ser Gly Tyr Tyr 
1 5 10 15 

Trp Ser Trp lie Arg Gin Pro Pro Gly Lys Gly Leu Glu Trp lie Gly 

20 25 30 

Glu He Asn Gin Ser Gly Ser Thr Asn Tvr Asn Pro Ser Leu Lys Ser 

35 40 45 

Arg Val He He Ser He Asp Thr Ser Lvs Thr Gin Phe Ser Leu Lys 

50 55 60 

Leu Ser Ser Val Thr Ala Ala Asp Thr Ala Val Tyr Tyr Cys Ala Arg 
65 70 75 80 

Glu Thr Pro His Ala Phe Asp lie Trp Gly Gin Gly Thr Met Val Thr 

85 90 95 

Val Ser Ser Ala Ser Thr Lys Gly Pro Ser Val Phe Pro Leu Ala Pro 

100 105 110 

Cys Ser Arg Ser Thr Ser Thr 
115 

(2) INFORMATION FOR SEQ ID NO:54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:54: 

Leu Ser Leu Thr Cys Ala Val Tyr Gly Gly Ser Phe Ser Gly Tyr Tyr 
1 5 10 15 

Trp Thr Trp He Arg Gin Pro Pro Gly Lvs Gly Leu Glu Trp He Gly 

20 25 30 

Glu He He His His Gly Asn Thr Asn Tyr Asn Pro Ser Leu Lys Ser 
35 40 45 
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Arg Val Ser lie Ser Val Asp Thr Ser Lys Asn Gin Phe Ser Leu Thr 

50 55 60 

Leu Ser Ser Val Thr Ala Ala Asp Thr Ala Val Tyr Tyr Cys Ala Arg 
65 70 75 80 

Gly Gly Ala Val Ala Ala Phe Asp Tyr Trp Gly Gin Gly Thr Leu Val 

85 90 95 

Thr Val Ser Ser Ala Ser Thr Lys Gly Pro Ser Val Phe Pro Leu Ala 

100 105 110 

Pro Cys Ser Arg Ser Thr Ser Thr 
115 120 

(2) INFORMATION FOR SEQ ID NO:55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:55: 

Ser His His Leu Lys He Ser Cys Lys Gly Ser Gly Tyr Ser Phe Thr 
1 5 10 15 

Ser Tyr Trp lie Gly Trp Val Arg Gin Met Pro Gly Lys Gly Leu Glu 

20 25 30 

Trp Met Gly He He Tyr Pro Gly Asp Ser Asp Thr Arg Tyr Ser Pro 

35 40 45 

Ser Phe Gin Gly Gin Val Thr lie Ser Ala Asp Lys Ser lie Ser Thr 

50 55 60 

Ala Tyr Leu Gin Trp Ser Ser Leu Lys Ala Ser Asp Thr Ala Met Tyr 
65 70 75 80 

Tyr Cys Ala Arg 



(2) INFORMATION FOR SEQ ID NO:56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 121 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:56: 
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Ser Leu Lys He Ser Cys Lys Glv Ser Gly Tyr Ser Phe Thr Ser Tyr 
1 5 10 15 

Trp lie Gly Trp Va. j-g Gin Met Pro Gly Lys Gly Leu Glu Trp Met 

20 25 30 

Gly He He Tyr Pro Gly Asp Ser Asp Thr Arg Tyr Ser Pro Ser Phe 

35 40 45 

Gin Gly Gin Val Thr He Ser Ala Asp Lys Ser He Ser Thr Ala Tyr 

50 55 60 

Leu Gin Trp Ser Ser Leu Lys Ala Ser Asp Thr Ala Met Tyr Tyr Cys 
65 70 75 80 

Ala Arg Gin Asp Gly Asp Ser Phe Asp Tyr Trp Gly Gin Gly Thr Leu 

85 90 95 

Val Thr Val Ser Ser AJa Ser Thr Lys Gly Pro Ser Val Phe Pro Leu 

100 105 310 

Ala Pro Cys Ser Arg Ser Thr Ser Thr 
115 120 

(2) INFORMATION FOR SEQ ID NO:57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:57: 

Arg Ser Leu Arg Leu Ser Cvs Ala Ala Ser Glv Phe Thr Phe Ser Ser 
I 5 10 15 

Tyr Gly Met His Trp Xaa Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp 

20 25 30 

Val AJa Val lie Ser Tyr Asp Glv Ser Asn Lys Tyr Tvr Ala Asp Ser 

35 40 45 

Val Lys Gly Arg Phe Thr He Ser Arg Asp Asn Ser Lvs Asn Thr Leu 

50 55 60 

Tyr Leu Gin Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr 
65 70 75 80 

Cys Ala Arg 



(2) INFORMATION FOR SEQ ID NO:58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 122 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:58: 

Arg Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Phe Thr Phe Ser Ser 
15 10 15 

Tyr Gly Met His Tip Xaa Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp 

20 25 30 

Val Ala Glu He Ser Tyr Asp Gly Ser Asn Lys Tyr Tyr Val Asp Ser 

35 40 45 

Val Lys Gly Arg Leu Thr He Ser Arg Asp Asn Ser Lys Asn Thr Leu 

50 55 60 

Tyr Leu Gin Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr 
65 70 75 80 

Cys Ala Arg Asp Arg Leu Gly He Phe Asp Tyr Trp Gly Gin Gly Thr 

85 90 95 

Leu Val Thr Val Ser Ser Ala Ser Thr Lys Gly Pro Ser Val Phe Pro 

100 105 110 

Leu Ala Pro Cys Ser Arg Ser Thr Ser Thr 
115 120 

(2) INFORMATION FOR SEQ ID NO:59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:59: 

Arg Thr Val Ala Ala Pro Ser Val Phe He Phe Pro Pro Ser Asp Glu 
1 5 10 15 



(2) INFORMATION FOR SEQ ID NO:60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 75 arnino acids 

(B) TYPE: arnino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 



Gin 
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(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:60: 

Thr He Thr Cys Gin Ala Ser Gin Asp He Ser Asn Tyr Leu Asn Tip 
1 5 10 15 

Tyr Gin Gin Lys Pro Gly Lys Ala Pro Lys Leu Leu He Tyr Asp Ala 

20 25 30 

Ser Asn Leu Glu Thr Gly Val Pro Ser Arg Phe Ser Gly Ser Gly Ser 

35 40 45 

Gly Thr Asp Phe Thr Phe Thr He Ser Ser Leu Gin Pro Glu Asp He 

50 55 60 

Ala Thr Tvr Tyr Cys Gin Gin Asp Asn Leu Pro 
65 70 75 

(2) INFORMATION FOR SEQ ID NO:6 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 04 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:61: 

Thr He Thr Cys Gin Ala Ser Gin Asp He Ser Lys Phe Leu Ser Trp 
I 5 10 15 

Phe Gin Gin Lys Pro Glv Lys Ala Pro Lvs Leu Leu He Tyr Gly Thr 

20 25 30 

Ser Tyr Leu Glu Thr Gly Val Pro Ser Ser Phe Ser Gly Ser Gly Ser 

35 40 45 

Gly Thr Asp Phe Thr Leu Thr He Ser Ser Leu Gin Pro Glu Asp Val 

50 55 60 

Ala Thr Tyr Phe Cys Gin Gin Asp Asp Leu Pro Tyr Thr Phe Gly Pro 
65 70 75 80 

Gly Thr Lys Val Asp He Lys Arg Thr Val Ala Ala Pro Ser Val Phe 

85 90 95 

lie Phe Pro Pro Ser Asp Glu Gin 
100 

(2) INFORMATION FOR SEQ ID NO:62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL; NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:62: 

Thr He Thr Cys Gin Ala Ser Gin Asp He Ser Asn Tyr Leu Asn Trp 
1 5 10 15 

Tyr Gin Gin Lvs Ala Gly Lys Ala Pro Lys Val Leu He Tyr Ala Ala 

20 25 30 

Ser Asn Leu Glu Ala Gly Val Pro Ser Arg Phe Ser Gly Ser Gly Ser 

35 40 45 

Gly Thr Asp Phe Thr Phe Thr He Ser Ser Leu Gin Pro Glu Asp He 

50 55 60 

Ala Thr Tyr Tyr Cys His Gin Asp Asn Leu Pro Leu Thr Phe Gly Gly 
65 70 75 80 

Gly Thr Lys Val Glu lie Lys Arg Thr Val Ala Ala Pro Ser Val Phe 

85 90 95 

He Phe Pro Pro Ser Asp Glu Gin 



(2) INFORMATION FOR SEQ ID NO:63: 

(t) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:63: 

Thr Cys Arg Ala Ser Gin Ser He Ser Ser Tyr Leu Asn Trp Tyr Gin 
15 10 15 

Gin Lys Pro Gly Lys Ala Pro Lys Leu Leu lie Tyr Ala Ala Ser Ser 

20 25 30 

Leu Gin Ser Gly Val Pro Ser Arg Phe Ser Gly Ser Gly Ser Gly Thr 

35 40 45 

Asp Phe Thr Leu Thr He Ser Ser Leu Gin Pro Glu Asp Phe Ala Thr 

50 55 60 

Tyr Tyr Cys Gin Gin Ser Tyr Ser Thr Pro 
65 70 

(2) INFORMATION FOR SEQ ID NO:64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 86 amino acids 

(B) TYPE: amino acid 



100 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:64: 

Thr Cys Arg Ala Ser Gin Ser He Ser Asn Tyr Leu Asn Trp Tyr Gin 
15 10 15 

Gin Lys Pro Gly Lys Ala Pro Lys Phe Leu He Tyr Gly Ala Ser Ser 

20 25 30 

Leu Glu Ser Gly Val Pro Ser Arg Phe Ser Gly Ser Gly Ser Gly Thr 

35 40 45 

Asp Phe Thr Leu Thr lie Ser Ser Leu Gin Pro Glu Asp Phe Ala Thr 

50 55 60 

Tyr Tyr Cys Gin Gin Ser Tyr Ser Asn Pro Leu Thr Phe Gly Gly Gly 
65 70 75 80 

Thr Lys Val Glu lie Lys 
85 

(2) INFORMATION FOR SEQ ID NO:65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:65: 

Thr lie Asn Cys Lys Ser Ser Gin Ser Val Leu Tyr Ser Ser Asn Asn 
15 10 15 

Lys Asn Tyr Leu Ala Trp Tyr Gin Gin Lys Pro Gly Gin Pro Pro Lys 

20 25 ' 30 

Leu Leu He Tyr Trp Ala Ser Thr Arg Glu Ser Gly Val Pro Asp Arg 

35 40 45 

Phe Ser Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu Thr He Ser Ser 

50 55 60 

Leu Gin Ala Glu Asp Val Ala Val Tvr Tyr Cys Gin Gin Tyr Tyr Ser 
65 70 75 80 

Thr Pro 



(2) INFORMATION FOR SEQ ID NO:66: 
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(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:66: 

Thr lie Asn Cys Lys Ser Ser Gin Ser Val Leu Tyr lie Ser Asn Asn 
1 5 10 15 

Lys Asn Tyr Leu Ala Trp Tyr Gin Gin Lys Pro Gly Gin Ser Pro Lys 

20 25 30 

Leu Leu He Tyr Trp Ala Ser Thr Arg Lys Ser Gly Val Pro Asp Arg 

35 40 45 

Phe Ser Glv Ser Gly Ser Gly Thr Asp Phe Thr Leu Thr He Ser Ser 

50 55 60 

Leu Gin Ala Glu Asp Val Ala Val Tyr Tyr Cys Gin Gin Tyr Tyr Asp 
65 70 75 80 

Thr Pro Phe Thr Phe Gly Pro Glv Thr Lys Val Asp lie Lys 
85 90 

(2) INFORMATION FOR SEQ ID NO:67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:67: 

Ala Ser Thr Lys Gly Pro Ser Val Phe Pro Leu Ala Pro Cys Ser Arg 
1 5 10 15 

Ser Thr Ser Thr 
20 

(2) INFORMATION FOR SEQ ID NO:68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 76 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:68: 

Val Ser Glv Gly Ser He Ser Ser Gly Gly Tyr Tyr Trp Ser Trp He 
1 5 10 15 

Arg Gin His Pro Gly Lys Gly Leu Glu Trp He Gly Tyr He Tvr Tyr 

20 25 30 

Ser Gly Ser Thr Tyr Tyr Asn Pro Ser Leu Lys Ser Arg Val Thr He 

35 40 45 

Ser Val Asp Thr Ser Lys Asn Gin Phe Ser Leu Lys Leu Ser Ser Val 

50 55 60 

Thr Ala Ala Asp Thr Ala Val Tyr Tyr Cys Ala Arg 
65 70 75 

(2) INFORMATION FOR SEQ ID NO:69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 18 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:69: 

Val Ser Glv Glv Ser He Asn Ser Glv Asp Tyr Tvr Trp Ser Trp He 
1 5 10 15 

Arg Gin His Pro Gly Lys Gly Leu Asp Cys He Gly Tyr lie Tyr Tyr 

20 25 30 

Ser Gly Ser Thr Tyr Tyr Asn Pro Ser Leu Lys Ser Arg Val Thr lie 

35 40 45 

Ser Val Asp Thr Ser Lys Asn Gin Phe Phe Leu Lys Leu Thr Ser Val 

50 55 60 

Thr Ala Ala Asp Thr Ala Val Tyr Tyr Cys Ala Arg Ser Thr Val Val 
65 70 75 80 

Asn Pro Gly Trp Phe Asp Pro Trp Glv Gin Gly Tvr Leu Val Thr Val 

85 90 95 

Ser Ser Ala Ser Thr Lys Gly Pro Ser Val Phe Pro Leu Ala Pro Cys 

100 105 110 

Ser Arg Ser Thr Ser Thr 
115 

(2) INFORMATION FOR SEQ ID NO:70: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 1 17 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:70: 

Val Ser Gly Gly Ser He Asn Ser Gly Asp Tyr Tyr Trp Ser Tip He 
15 10 15 

Arg Gin His Pro Gly Lys Gly Leu Glu Trp He Gly Ser He Tyr Tyr 

20 25 30 

Ser Gly Asn Thr Phe Tyr Asn Pro Ser Leu Lys Ser Arg Val Thr He 

35 40 45 

Ser Leu Asp Thr Ser Lys Asn Gin Phe Ser Leu Lys Leu Ser Ser Val 

50 55 60 

Thr Ala Ala Asp Thr Ala Val Cys Tyr Cys Ala Arg Asn lie Val Thr 
65 70 75 * 80 

Thr Gly Ala Phe Asp He Trp Gly Gin Gly Thr Met Val Thr Val Ser 

85 90 95 

Ser Ala Ser Thr Lys Gly Pro Ser Val Phe Pro Leu Ala Pro Cys Ser 

100 105 110 

Arg Ser Thr Ser Thr 
115 

(2) INFORMATION FOR SEQ ID NO:71 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7l: 

Val Ser Gly Gly Ser He Ser Ser Gly Asp Tyr Tyr Trp Thr Trp He 
15 10 15 

Arg Gin His Pro Gly Lys Gly Leu Glu Trp He Gly Tyr He Tyr Tyr 

20 25 30 

Ser Gly Asn Thr Tyr Tyr Asn Pro Ser Leu Lys Ser Arg Val Ser Met 

35 40 45 

Ser He Asp Thr Ser Glu Asn Gin Phe Ser Leu Lys Leu Ser Ser Val 

50 55 60 

Thr Ala Ala Asp Thr Ala Val Tyr Tyr Cys Ala Arg Lys Pro Val Thr 
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65 70 75 80 

Gly Gly Glu Asp Tyr Tip Gly Gin Gly Thr Leu Val Thr Val Ser Ser 

85 90 95 

Ala Ser Thr Lys Gly fro Ser Val Phe Pro Leu Ala Pro Cys Ser Arg 

100 105 110 

Ser Thr Ser Thr 
115 

(2) INFORMATION FOR SEQ ID N0.72; 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 76 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:72: 

Val Ser Gly Gly Ser Val Ser Ser Gly Ser Tyr Tvr Trp Ser Trp He 
15 10 15 

Arg Gin Pro Pro Gly Lys Gly Leu Glu Trp He Gly Tyr He Tyr Tyr 

20 25 30 

Ser Gly Ser Thr Asn Tyr Asn Pro Scr Leu Lvs Ser Arg Val Thr lie 

35 40 45 

Ser Val Asp Thr Ser Lys Asn Gin Phe Ser Leu Lys Leu Ser Ser Val 

50 55 60 

Thr Ala Ala Asp Thr Ala Val Tvr Tvr Cvs Ala Arg 
65 70 75 " 

(2) INFORMATION FOR SEQ ID NO:73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 76 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:73: 

Val Ser Gly Gly Ser Val Ser Ser Gly Ser Tyr Tyr Trp Ser Trp He 
1 5 10 15 

Arg Gin Pro Pro Gly Lys Gly Leu Glu Trp He Gly Tyr He Tyr Tyr 
20 25 30 
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Ser Gly Ser Thr Asn Tyr Asn Pro Ser Leu Lys Ser Arg Val Thr He 



Ser Val Asp Thr Sr* T .ys Asn Gin Phe Ser Leu Lys Leu Ser Ser Val 



(2) INFORMATION FOR SEQ ID NO:74: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 1 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:74: 

Val Ser Glv Gly Ser Val Ser Ser Gly Asp Tyr Tyr Trp Ser Trp He 
1 5 10 15 

Arg Gin Pro Pro Gly Lys Gly Leu Glu Trp He Gly His Leu Tyr Tyr 

20 25 30 

Ser Gly Asn Thr Asn Tyr Asn Pro Ser Leu Lys Ser Arg Val Thr He 

35 40 " 45 

Ser Leu Asp Thr Ser Lys Asn Gin Phe Ser Leu Lys Leu Ser Ser Val 

50 55 60 

Thr Ala Ala Asp Thr Ala Val Tyr Tyr Cys Ala Arg Asp Phe Leu Thr 
65 70 75 80 

Gly Ser Phe Phe Asp Tvr Trp Glv Gin Glv Thr Leu Val Thr Val Ser 

85 90 95 

Ser Ala Ser Thr Lys Gly Pro Ser Val Phe Pro Leu Ala Pro Cys Ser 

100 105 110 

Arg Ser Thr Ser Thr 
115 

(2) INFORMATION FOR SEQ ID NO:75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:75: 



35 



40 



45 



50 55 60 

Thr Ala Ala Asp thr Ala Val Tyr Tyr Cvs Ala Arg 
65 70 75 
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Arg Thr Val Ala Ala Pro Scr Val Phe He Phe Pro Pro Ser Asp Glu 



(2) INFORMATION FOR SEQ ID NO:76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 75 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0.76: 

Thr He Thr Cys Gin Ala Scr Gin Asp He Ser Asn Tyr Leu Asn Trp 
1 5 10 15 

Tyr Gin Gin Lys Pro Gly Lys Ala Pro Lys Leu Leu He Tyr Asp Ala 

20 25 30 

Ser Asn Leu Glu Thr Gly Val Pro Ser Arg Phe Ser Glv Ser Gly Ser 

35 40 45 

Gly Thr Asp Phe Thr Phe Thr He Ser Ser Leu Gin Pro Glu Asp He 

50 55 60 

Ala Thr Tyr Tyr Cys Gin Gin Asp Asn Leu Pro 
65 70 75 

(2) INFORMATION FOR SEQ ID N0:77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:77: 

Thr He Thr Cys Gin Ala Ser Gin Asp He Asn Asn Tyr Leu Asn Trp 
1 5 10 15 

Phe Gin Gin Lys Pro Gly Lys Ala Pro Lys Val Leu He His Asp Ala 

20 25 30 

Ser Asn Leu Glu Thr Gly Gly Pro Ser Arg Phe Ser Gly Ser Gly Ser 

35 40 45 

Gly Thr Asp Phe Thr Phe Thr He Ser Gly Leu Gin Pro Glu Asp He 



5 



10 



15 



Gin 
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50 55 60 

Ala Thr Tyr Tyr Cys Gin Gin Glu Ser Leu Pro Leu Thr Phe Gly Gly 
65 70 75 80 

Gly Thr Lys Val Glu He Lys Arg Thr Val Ala Ala Pro Ser Val Phe 

85 90 95 

lie Phe Pro Pro Ser Asp Glu Gin 
100 

(2) INFORMATION FOR SEQ ID NO:78: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 04 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:78: 

Thr He Thr Cys Gin Ala Ser Gin Asp He Thr He Tvr Leu Asn Trp 
15 10 15 

Tyr Gin Gin Lys Pro Gly Lys Ala Pro Lys Leu Leu He Asn Asp Ala 

20 25 30 

Ser Ser Leu Glu Thr Gly Val Pro Leu Arg Phe Ser Glv Ser Gly Ser 

35 40 45 

Gly Thr Asp Phe Thr Phe Thr He Ser Ser Leu Gin Pro Glu Asp lie 

50 55 60 

Ala Thr Tyr Tyr Cys Gin Gin Asp His Leu Pro Leu Thr Phe Gly Gly 
65 70 75 80 

Gly Thr Lvs Val Ala He Lvs Arg Thr Val Ala Ala Pro Ser Val Phe 

85 90 95 

He Phe Pro Pro Ser Asp Giu Gin 
100 

(2) INFORMATION FOR SEQ ID NO:79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:79: 
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Thr He Thr Cys Gin Ala Ser Gin Asp lie Ser Asn Tyr Leu Asn Trp 
1 5 10 15 

Tyr Gin Gin Lys Pro Gly Lys Ala Pro Lys Leu Leu lie Tyr Asp Ala 

20 25 30 

Ser Asn Leu Glu Thr Gly Val Pro Ser Arg Phe Ser Gly Ser Gly Ser 

35 40 45 

Gly Thr Asp Phe Thr Phe Thr He Ser Ser Leu Gin Pro Glu Asp Val 

50 55 60 

Gly Thr Tyr Tyr Val Gin Gin Glu Ser Leu Pro Cvs Gly Phe Gly Gin 
65 70 75 80 

Gly Thr Lys Leu Glu He Lys Arg Thr Val Ala Ala Pro Ser Val Phe 

85 90 95 

He Phe Pro Pro Ser Asp Glu Gin 
100 

(2) INFORMATION FOR SEQ ID NO:80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(hi) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:80: 

Thr lie Thr Cys Gin Ala Ser Gin Asp lie Ser Asn Tyr Leu Asn Trp 
15 10 15 

Tyr Gin Gin Lvs Pro Gly Lys Ala Pro Lvs Leu Leu He Asn Asp Ala 

20 25 30 ' 

Ser Asp Leu Glu Thr Gly Val Pro Ser Arg He Ser Glv Ser Gly Ser 

35 40 45 

Gly Thr Asp Phe Thr Phe Thr lie Ser Asn Leu Gin Pro Glu Asp He 

50 55 60 

Ala Thr Tyr Tyr Cys Gin Gin Asp Ser Leu Pro Leu Thr Phe Gly Gly 
65 70 75 80 

Gly Thr Lys Val Glu lie Arg Arg Thr Val Ala Ala Pro Ser Val Phe 

85 90 95 

lie Phe Pro Pro Ser Asp Glu Gin 
100 
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Claims 

WHAT is Claimed is; 

1. A transgenic non-human mammal having a genome, the genome comprising 
modifications, the modifications comprising: 

an inactivated endogenous immunoglobulin (Ig) locus, such that the mammal would 
not display normal B-cell development; 

an inserted human heavy chain Ig locus in substantially germline configuration, the 
human heavy chain Ig locus comprising a human mu constant region and regulatory and 
switch sequences thereto, a plurality of human J H genes, a plurality of human D H genes, and 
a plurality of human V H genes; and 

an inserted human kappa light chain Ig locus in substantially germline configuration, 
the human kappa light chain Ig locus comprising a human kappa constant region, a plurality 
of Jk genes, and a plurality of Vk genes, 

wherein the number of V H and Vk genes inserted are selected to substantially restore normal 
B-cell development in the mammal. 

2. The mammal of Claim 1, wherein the heavy chain Ig locus comprises a second 
constant region selected from the group consisting of human gamma- 1, human gamma-2, human 
gamma-3, human gamma-4, alpha, epsilon, and delta. 

3 . The mammal of Claim 1 , wherein the number of V H genes is greater than about 20. 

4. The mammal of Claim 1, wherein the number of Vk genes is greater than about 15. 

5. The mammal of Claim 1, wherein the number of D H genes is greater than about 25, 
the number of J H genes is greater than about 4, the number of V H genes is greater than about 20, the 
number of Jk genes is greater than about 4, and the number of Vk genes is greater than about 15. 

6. The mammal of Claim 1, wherein the number of D H genes, the number of J H genes, 
the number of V H genes, the number of Jk genes, and the number of Vk genes are selected such that 
the Ig loci are capable of encoding greater than about 1 x 10 5 different functional antibody sequence 
combinations. 

7. The mammal of Claim 1, wherein in a population of mammals B-cell function is 
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reconstituted on average to greater than about 50% as compared to wild type. 

8. In a transgenic non-human mammal having a genome that comprises modifications, 
the modifications rendering the mammal capable of producing human immunoglobulin molecules but 
substantially incapable of producing functional endogenous antibody molecules, the improvement 
comprising: 

insertion into the genome of the mammal of sufficient human V H , D H , J H , Vic, and Jk 
genes such that the mammal is capable encoding greater than about 1 x 10 6 different 
functional human immunoglobulin sequence combinations, without accounting for junctional 
diversity or somatic mutation events. 

9. In a transgenic non-human mammal having a genome that comprises modifications, 
the modifications rendering the mammal capable of producing human immunoglobulin molecules but 
substantially incapable of producing functional endogenous antibody molecules, which modifications, 
with respect to the mammal's incapacity to produce functional endogenous antibody molecules would 
not allow the mammal to display normal B-cell development, the improvement comprising: 

insertion into the genome of the mammal of sufficient human V H , D H , J H , Vk, and Jk 
genes such that the mammal is capable of encoding greater than about 1 x 10 6 different 
functional human immunoglobulin sequence combinations and sufficient V H and Vk genes to 
substantially restore normal B-cell development in the mammal. 

10. In the mammal of Claim 9, wherein in a population of mammals B-cell function is 
reconstituted on average to greater than about 50% as compared to wild type. 

.11. A transgenic non-human mammal having a genome, the genome comprising 
modifications, the modifications comprising: 

an inactivated endogenous heavy chain immunoglobulin (Ig) locus; 
an inactivated endogenous kappa light chain Ig locus; 

an inserted human heavy chain Ig locus, the human heavy chain Ig locus comprising 
a nucleotide sequence substantially corresponding to the nucleotide sequence of yH2; and 

an inserted human kappa light chain Ig locus, the human kappa light chain Ig locus 
comprising a nucleotide sequence substantially corresponding to the nucleotide sequence of 
yK2. 
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12. A transgenic non-human mammal having a genome, the genome comprising 
modifications, the modifications comprising: 

an inactivated endogenous heavy chain immunoglobulin (Ig) locus; 

an inserted human heavy chain Ig locus, the human heavy chain Ig locus comprising 
a nucleotide sequence substantially corresponding to the nucleotide sequence of yH2; and 

an inserted human kappa light chain Ig locus, the human kappa light chain Ig locus 
comprising a nucleotide sequence substantially corresponding to the nucleotide sequence of 
yK2. 

13. A transgenic non-human mammal having a genome, the genome comprising 
modifications, the modifications comprising: 

an inactivated endogenous heavy chain immunoglobulin (Ig) locus; 
an inactivated endogenous kappa light chain Ig locus; 

an inserted human heavy chain Ig locus, the human heavy chain Ig locus comprising 
a nucleotide sequence substantially corresponding to the nucleotide sequence of yH2 without 
the presence of a human gamma-2 constant region; and 

an inserted human kappa light chain Ig locus, the human kappa light chain Ig locus 
comprising a nucleotide sequence substantially corresponding to the nucleotide sequence of 
yK2. 

14. A transgenic non-human mammal having a genome, the genome comprising 
modifications, the modifications comprising: 

an inactivated endogenous heavy chain immunoglobulin (Ig) locus; 
an inactivated endogenous kappa light chain Ig locus; 

an inserted human heavy chain Ig locus, the human heavy chain Ig locus comprising 
a nucleotide sequence substantially corresponding to the nucleotide sequence of yH2 without 
the presence of a human gamma-2 constant region; and 

an inserted human kappa light chain Ig locus, the human kappa light chain Ig locus 
comprising a nucleotide sequence substantially corresponding to the nucleotide sequence of 
yK2. 

15. A transgenic non-human mammal having a genome, the genome comprising 
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modifications, the modifications comprising: 

an inactr-tted endogenous heavy chain immunoglobulin (Ig) locus; 

an inserted human heavy chain Ig locus, the human heavy chain Ig locus comprising 
a nucleotide sequence substantially corresponding to the nucleotide sequence of yH2 without 
the presence of a human gamma-2 constant region; and 

an inserted human kappa light chain Ig locus, the human kappa light chain Ig locus 
comprising a nucleotide sequence substantially corresponding to the nucleotide sequence of 
yK2. 

16. A method for the production of human antibodies, comprising: 
inoculating a mammal of any one of Claims 1-10 with an antigen; 

collecting and immortalizing lymphocytic cells to obtain immortal cell lines secreting 
human antibodies that specifically bind to the antigen with an affinity of greater than 10 9 M* 1 ; 
and 

isolating the antibodies from the immortal cell lines. 

1 7. The method of Claim 1 1 , wherein the antigen is EL-8. 

18. The method of Claim 1 1 , wherein the antigen is EGFR. 

19. The method of Claim 1 1 , wherein the antigen is TNF-a. 

20. An antibody produced by the method of Claim 1 1 . 

21. An anti-IL-8 antibody produced by the method of Claim 12. 

22. An anti-EGFR antibody produced by the method of Claim 13. 

23. An anti-TNF-a antibody produced by the method of Claim 14. 

24. In a method for the production of transgenic mice, the transgenic mice having a 
genome, the genome comprising modifications, the modifications comprising insertion of a plurality 
of human variable regions, the improvement comprising: 

insertion of the human variable regions from a yeast artificial chromosome. 

25. Transgenic mice and transgenic offspring therefrom produced through use of the 
improvement of Claim 19. 

26. In a transgenic mammal, the transgenic mammal comprising a genome, the genome 
comprising modifications, the modifications comprising an inserted human heavy chain 
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immunoglobulin transgene, the improvement comprising: 

the transgene comprising selected sets of human variable region genes that enable 
human-like junctional diversity and human-like complementarity determining region 3 (CDR3) 
lengths. 

27. In the improvement of Claim 26, wherein the human-like junctional diversity comprises 
average N-addition lengths of 7.7 bases. 

28. In the improvement of Claim 26, wherein the human-like CDR3 lengths comprise 
between about 2 through about 25 residues with an average of about 14. 
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FIGURE 3 
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